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IMPROVED POLY AMIDES FOR BINDING IN THE MINOR 
GROOVE OF DOUBLE STRANDED DNA 

5 The U.S. Government has certain rights in this invention pursuant to Grant Nos. GM 

26453, 27681 and 47530 awarded by the National Institute of Health. 

CROSS REFERENCE TO RELATED APPLICATIONS 

10 This application is a continuation-in-part of PCT/US97/03332 filed February 20, 1997, 

Serial No. 08/853,522 filed May 8, 1997 and PCT/US 97/12722 filed July 21, 1997 which are 
continuation-in-part applications of Serial No. 08/837,524, filed April 21, 1997, Serial No. 
08/607,078, filed February 26, 1996, provisional application Serial No. 60/042,022, filed April 
16, 1997 and provisional application Serial No. 60/043,444, filed April 8, 1997. 

15 

BACKGROUND OF THE INVENTION 

Field of the Invention 

20 This invention relates to polyatnides which bind to predetermined sequences in the 

minor groove of double stranded DNA. 

Description of the Related Art 

25 The design of synthetic ligands that read the information stored in the DNA double helix 

has been a long standing goal of chemistry. Cell-permeable small molecules which target 
predetermined DNA sequences are useful for the regulation of gene-expression. 
Oligodeoxynucleotides that recognize the major groove of double-helical DNA via triple-helix 
formation bind to a broad range of sequences with high affinity and specificity. Although 

30 oligonucleotides and their analogs have been shown to interfere with gene expression, the triple 
helix approach is limited to purine tracks and suffers from poor cellular uptake. The 
development of pairing rules for minor groove binding polyamides derived from N- 
methylpyrrole (Py) and N-methylimidazole (Im) amino acids provides another code to control 
sequence specificity. An Im/Py pair distinguishes G»C from C»G and both of these from A»T 

35 or T»A base pairs. Wade, W.S., Mrksich, M. & Dervan, P.B. describes the design of peptides 
that bind in the minor groove of DNA at SHA/T^A/r^A/TJ-S' sequences by a dimeric 
side-by-side motif. J. Am. Chem. Soc. 114, 8783-8794 (1992); Mrksich, M. et ah describes 
antiparallel side-by-side motif for sequence specific-recognition in the minor groove of DNA by 
the designed peptide l-methylimidazole-2-carboxamidenetropsin. Proc. Nath Acad. Sci. USA 

40 89, 7586-7590 (1992); Trauger, J.W., Baird, E. E. Dervan, P.B. describes the recognition of 
DNA by designed ligands at subnanomolar concentrations. Nature 382, 559-561 (1996). A 



WO 98/37066 



PCT/US98/01006 



Py/Py pair specifies A«T from G«C but does not distinguish A»T from T»A. Pelton, J.G. & 
Wemrner, D.E. describes the structural characterization of a 2-1 distamycin A- 
d(CGCAAATTTGGC) complex by two-dimensional NMR. Proc. Natl Acad. ScL USA 86, 
5723-5727 (1989); White, S., Baird, E. E. & Dervan, P.B. Describes the effects of the A»T/T«A 

5 degeneracy of pyrrole-imidazole polyamide recognition in the minor groove of DNA. 
Biochemistry 35, 6147-6152 (1996); White, S„ Baird, E. E. & Dervan, P. B. describes the 
pairing rules for recognition in the minor groove of DNA by pyrrole-imidazole polyamides. 
Chem. & Biol. 4, 569-578 (1997); White, S„ Baird, E. E. & Dervan, P.B. describes the 5'-3* N- 
C orientation preference for polyamide binding in the minor groove. In order to break this 

10 degeneracy, a new aromatic amino acid, 3-hydroxy-N-methylpyrrole (Hp) incorporated into a 
polyamide and paired opposite Py, has been found to discriminate A«T from T>A. The 
replacement of a single hydrogen atom on the pyrrole with a hydroxy group in a Hp/Py pair 
regulates affinity and specificity of a polyamide by an order of magnitude. Utilizing Hp 
together with Py and Im in polyamides to form four aromatic amino acid pairs (Im/Py, Py/Im, 

15 Hp/Py, and Py/Hp) provides a code to distinguish all four Watson-Crick base pairs in the minor 
groove of DNA. 

SUMMARY OF THK IIWFNTION 

20 The invention encompasses improved polyamides for binding to the minor groove of 

double stranded ("duplex") DNA. The polyamides are in the form of a hairpin comprising two 
groups of at least three consecutive carboxamide residues, the two groups covalently linked by 
an aliphatic amino acid residue, preferably y-aminobutyric acid or 2,4 diaminobutyric acid, the 
consecutive carboxamide residues of the first group pairing in an antiparallei manner with the 

25 consecutive carboxamide residues of the second group in the minor groove of double stranded 
DNA. The improvement relates to the inclusion of a binding pair of Hp/Py carboxamides in the 
polyamide to bind to a T»A base pair in the minor groove of double stranded DNA or Py/Hp 
carboxamide binding pair in the polyamide to bind to an A»T base pair in the minor groove of 
double stranded DNA. The improved polyamides have at least three consecutive carboxamide 

30 pairs for binding to at least three DNA base pairs in the minor groove of a duplex DNA 
sequence that has at least one A»T or T»A DNA base pair, the improvement comprising 
selecting a Hp/Py carboxamide pair to correspond to a T»A base pair in the minor groove or a 
Py/Hp carboxamide pair to bind to an A«T DNA base pair in the minor groove. Preferably the 
binding of the carboxamide pairs to the DNA base pairs modulates the expression of a gene. 

35 

In one preferred embodiment, the polyamide includes at least four consecutive 
carboxamide pairs for binding to at least four base pairs in a duplex DNA sequence. In another 
preferred embodiment, the polyamide includes at least five consecutive carboxamide pairs for 
binding to at least five base pairs in a duplex DNA sequence. In yet another preferred 
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embodiment, the polyamide includes at least six consecutive carboxamide pairs for binding to at 
least six base pairs in a duplex DNA sequence. In one preferred embodiment, the improved 
polyamides have four carboxamide binding pairs that will distinguish A»T, T«A, OG and G»C 
base pairs in the minor groove of a duplex DNA sequence. The duplex DNA sequence can be a 
5 regulatory sequence, such as a promoter sequence or an enhancer sequence, or a gene sequence, 
such as a coding sequence or a non-coding sequence. Preferably, the duplex DNA sequence is a 
promoter sequence. 



The preparation and the use of polyamides for binding in the minor groove of double 
10 stranded DNA are extensively described in the art. This invention is an improvement of the 
existing technology that uses 3-hydroxy-N-methylpyrrole to provide carboxamide binding pairs 
for DNA binding polyamides. 



The invention encompasses polyamides having y-aminobutyric acid or a substituted y- 
1 5 aminobutyric acid to form a hairpin with a member of each carboxamide pairing on each side of 
it. Preferably the substituted y-aminobutyric acid is a chiral substituted y-aminobutyric acid 
such as (R)-2,4-diaminobutyric acid. In addition, the polyamides may contain an aliphatic 
amino acid residue, preferably a p-alanine residue, in place of a non-Hp carboxamide. The p- 
alanine residue is represented in formulas as p. The p-alanine residue becomes a member of a 
20 carboxamide binding pair. The invention further includes the substitution as a p«p binding pair 
for non-Hp containing binding pair. Thus, binding pairs in addition to the Hp/Py and Py/Hp are 
Im/p, p/Im, Py/p, p/Py, and p/p. 

The polyamides of the invention can have additional moieties attached covalently to the 
25 polyamide. Preferably the additional moieties are attached as substituents at the amino terminus 
of the polyamide, the carboxy terminus of the polyamide, or at a chiral (R)-2,4-diaminobutyric 
acid residue. Suitable additional moieties include a detectable labeling group such as a dye, 
biotin or a hapten. Other suitable additional moieties are DNA reactive moieties that provide 
for sequence specific cleavage of the duplex DNA. 

30 

Brief Description of the Drawings 

Figure 1 illustrates the structure of polyamide 1^ 2 g and 3. 
Figure 2 illustrates the pairing of polyamides to DNA base pairs. 
35 Figure 3 illustrates the DNase footprint titration of compounds 2 and 3. 

Figure 4 illustrates a list of the structures of representative Hp containing polyamides. 
Figure 5 illustrates the synthesis of a protected Hp monomer for solid phase synthesis. 
Figure 6 illustrates the solid phase synthesis of polyamide 2. 
Figure 7 illustrates the 1H-NMR characterization of polyamide 2. 
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Figure 8 illustrates the Mass spectral characterization of polyamide 2. 
Figure.9 illustrates 1H-NMR characterization of synthesis purity. 
Figure 1 0 illustrates DNasel footprint titration experiment. 
Figure 1 1 illustrates the synthesis of Afunctional conjugate of polyamide 2. 
Figure 12 illustrates affinity cleaving evidence for oriented hairpin formation. 
Figure 13 illustrates increased sequence specificity of Hp/Py containing polyamides. 
Figure 14 illustrates 8-ring hairpin polyamides which target S'-WGTNNW^* sites. 
Figure 15 illustrates 8-ring hairpin polyamides which target 5'-WGANNW-3* sites. 
Figure 16 illustrates 8-ring hairpin polyamides which target S'-WGGNNW-S* sites. 
Figure 17 illustrates 8-ring hairpin polyamides which target 5'-WGCNNW-3' sites. 

DETAILED DESCRIPTION OF THE INVENTION 

Within this application, unless otherwise stated, definitions of the terms and illustration 
of the techniques of this application may be found in any of several well-known references such 
as: Sambrook, J., et aL 9 Molecular Cloning: A Laboratory? Manual, Cold Spring Harbor 
Laboratory Press (1989); Goeddel, D., ed, Gene Expression Technology, Methods in 
Enzymology, 185, Academic Press, San Diego, CA (1991); "Guide to Protein Purification" in 
Deutshcer, M.P., ed„ Methods in Enzymology, Academic Press, San Diego, CA (1989); Irrnis, et 
al„ PCR Protocols: A Guide to Methods and Applications, Academic Press, San Diego, CA 
(1990); Freshney, RX, Culture of Animal Cells: A Manual of Basic Technique, 2 nd Ed., Alan 
Liss, Inc. New York, NY (1987); Murray, E J., ed, Gene Transfer and Expression Protocols* 
pp. 109-128, The Humana Press Inc., Clifton, NJ and Lewin, B., Genes VI t Oxford University 
Press, New York (1997). 

For the purposes of this application, a promoter is a regulatory sequence of DNA that is 
involved in the binding of RNA polymerase to initiate transcription of a gene. A gene is a 
segment of DNA involved in producing a peptide, polypeptide or protein, including the coding 
region, non-coding regions preceding ("leader") and following (trailer") the coding region, as 
well as intervening non-coding sequences ("introns") between individual coding segments 
("exons"). Coding refers to the representation of amino acids, start and stop signals in a three 
base "triplet" code. Promoters are often upstream (" '5 to") the transcription initiation site of 
the corresponding gene. Other regulatory sequences of DNA in addition to promoters are 
known, including sequences involved with the binding of transcription factors, including 
response elements that are the DNA sequences bound by inducible factors. Enhancers comprise 
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yet another group of regulatory sequences of DNA that can increase the utilization of 
promoters, and can function in either orientation (5*-3 J or 3*-5') and in any location (upstream 
or downstream) relative to the promoter. Preferably, the regulatory sequence has a positive 
activity, i.e., binding of an endogeneous ligand (e.g. a transcription factor) to the regulatory 
sequence increases transcription, thereby resulting in increased expression of the corresponding 
target gene. In such a case, interference with transcription by binding a polyamide to a 
regulatory sequence would reduce or abolish expression of a gene. 



10 



The promoter may also include or be adjacent to a regulatory sequence known in the art 
as a silencer, A silencer sequence generally has a negative regulatory effect on expression of 
the gene. In such a case, expression of a gene may be increased directly by using a polyamide 
to prevent binding of a factor to a silencer regulatory sequence or indirectly, by using a 
polyamide to block transcription of a factor to a silencer regulatory sequence. 



*5 It is to be understood that the polyamides of this invention bind to double stranded DNA 

in a sequence specific manner. The function of a segment of DNA of a given sequence, such as 
5*-TATAAA~3\ depends on its position relative to other functional regions in the DNA 
sequence. In this case, if the sequence 5'-TATAAA-3* on the coding strand of DNA is 
positioned about 30 base pairs upstream of the transcription start site, the sequence forms part 

20 of the promoter region (Lewin, Genes VI pp. 831-835). On the other hand, if the sequence 5*- 
TATAAA-3* is downstream of the transcription start site in a coding region and in proper 
register with the reading frame, the sequence encodes the tyrosyl and lysyl amino acid residues 
(Lewin, Genes VI pp. 213-215). 

25 

While not being held to one hypothesis, it is believed that the binding of the polyamides 
of this invention modulate gene expression by altering the binding of DNA binding proteins, 
such as RNA polymerase, transcription factors, TBF, TFIIEB and other proteins. The effect on 
gene expression of polyamide binding to a segment of double stranded DNA is believed to be 
30 related to the function, e.g., promoter, of that segment of DNA. 



It is to be understood by one skilled in the art that the improved polyamides of the 
present invention may bind to any of the above-described DNA sequences or any other 
sequence having a desired effect upon expression of a gene. In addition, U.S. Patent No. 
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5,578,444 describes numerous promoter targeting sequences from which base pair sequences 
for targeting an improved polyamide of the present invention may be identified. 

It is generally understood by those skilled in the art that the basic structure of DNA in a 
5 living cell includes both major and a minor groove. For the purposes of describing the present 
invention, the minor groove is the narrow groove of DNA as illustrated in common molecular 
biology references such as Lewin, B., Genes VI> Oxford University Press, New York (1997), 

To affect gene expression in a cell, which may include causing an increase or a decrease 
10 in gene expression, a effective quantity of one or more polyamide is contacted with the cell and 
internalized by the cell. The cell may be contacted in vivo or in vitro. Effective extracellular 
concentrations of polyamides that can modulate gene expression range from about 10 
nanomolar to about 1 micromolar. Gottesfeld, J.M., et aL, Nature 387 202-205 (1997). To 
determine effective amounts and concentrations of polyamides in vitro, a suitable number of 
15 cells is plated on tissue culture plates and various quantities of one or more polyamide are 
added to separate wells. Gene expression following exposure to a polyamide can be monitored 
in the cells or medium by detecting the amount of the protein gene product present as 
determined by various techniques utilizing specific antibodies, including ELISA and western 
blot. Alternatively, gene expression following exposure to a polyamide can be monitored by 
20 detecting the amount of messenger RNA present as determined by various techniques, including 
northern blot and RT-PCIL 

Similarly, to determine effective amounts and concentrations of polyamides for in vivo 
administration, a sample of body tissue or fluid, such as plasma, blood, urine, cerebrospinal 

25 fluid, saliva, or biopsy of skin, muscle, liver, brain or other appropriate tissue source is 
analyzed. Gene expression following exposure to a polyamide can be monitored by detecting 
the amount of the protein gene product present as determined by various techniques utilizing 
specific antibodies, including ELISA and western blot. Alternatively, gene expression 
following exposure to a polyamide can be monitored by the detecting the amount of messenger 

30 RNA present as determined by various techniques, including northern blot and RT-PCR. 

The polyamides of this invention may be formulated into diagnostic and therapeutic 
compositions for in vivo or in vitro use. Representative methods of formulation may be found 
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10 



15 



20 



30 



in Remington: The Science and Practice of Pharmacy, 19th ed., Mack Publishing Co., Eastern, 
PA (1995). 

For in vivo use, the polyamides may be incorporated into a physiologically acceptable 
pharmaceutical composition that is administered to a patient in need of treatment or an animal 
for medical or research puiposes. The polyamide composition comprises pharmaceutical^ 
acceptable carriers, excipients, adjuvants, stabilizers, and vehicles. The composition may be in 
solid, liquid, gel, or aerosol form. The polyamide composition of the present invention may be 
administered in various dosage forms orally, parentally, by inhalation spray, rectally, or 
topically. The term parenteral as used herein includes, subcutaneous, intravenous, 
intramuscular, intrasternal, infusion techniques or intraperitoneally. 

The selection of the precise concentration, composition, and delivery regimen is 
influenced by, inter alia, the specific pharmacological properties of the particular selected 
compound, the intended use, the nature and severity of the condition being treated or diagnosed, 
the age, weight, gender, physical condition and mental acuity of the intended recipient as well 
as the route of administration. Such considerations are within the purview of the skilled artisan. 
Thus, the dosage regimen may vary widely, but can be determined routinely using standard 
methods. 



Polyamides of the present invention are also useful for detecting the presence of double 
stranded DNA of a specific sequence for diagnostic or preparative puiposes. The sample 
containing the double stranded DNA can be contacted by polyamide linked to a solid substrate, 
thereby isolating DNA comprising a desired sequence. Alternatively, polyamides linked to a 
25 suitable detectable marker, such as biotin, a hapten, a radioisotope or a dye molecule, can be 
contacted by a sample containing double stranded DNA. 

The design of bifunctional sequence specific DNA binding molecules requires the 
integration of two separate entities: recognition and functional activity. Polyamides that 
specifically bind with subnanomolar affinity to the minor groove of a predetermined sequence 
of double stranded DNA are linked to a functional molecule, providing the corresponding 
bifunctional conjugates useful in molecular biology, genomic sequencing, and human medicine. 
Polyamides of this invention can be conjugated to a variety of functional molecules, which can 
be independently chosen from but is not limited to aryiboronic acids, biotins, polyhistidines 
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comprised from about 2 to 8 amino acids, haptens to which an antibody binds, solid phase 
supports, oligodeoxynucleotides, N-ethylnitrosourea, fluorescein, bromoacetamide, 
iodoacetamide, DL-a-iipoic acid, acridine, captothesin, pyrene, mitomycin, texas red, 
anthracene, anthrinilic acid, avidin, DAPI, isosulfan blue, malachite green, psoralen, ethyl red, 
4-(psoraen-8-yloxy)-butyrate, tartaric acid, (+)-a-tocopheral, psoralen, EDTA, methidium, 
acridine, Ni(II)»Gly-Gly-His, TO, Dansyl, pyrene, N-bromoacetamide, and gold particles. Such 
bifunctional polyamides are useful for DNA affinity capture, covalent DNA modification, 
oxidative DNA cleavage, DNA photocleavage. Such bifunctional polyamides are useful for 
DNA detection by providing a polyamide linked to a detectable label. Detailed instructions for 
synthesis of such bifunctional polyamides can be found in copending U.S* provisional 
application 60/043,444, the teachings of which are incorporated by reference. 

DNA complexed to a labeled polyamide can then be determined using the appropriate 
detection system as is well known to one skilled in the art. For example, DNA associated with 
a polyamide linked to biotin can be detected by a streptavidin I alkaline phosphatase system. 

The present invention also describes a diagnostic system, preferably in kit form, for 
assaying for the presence of the double stranded DNA sequence bound by the polyamide of this 
invention in a body sample, such brain tissue, cell suspensions or tissue sections, or body fluid 
samples such as CSF, blood, plasma or serum, where it is desirable to detect the presence, and 
preferably the amount, of the double stranded DNA sequence bound by the polyamide in the 
sample according to the diagnostic methods described herein. 

The diagnostic system includes, in an amount sufficient to perform at least one 
assay, a specific polyamide as a separately packaged reagent. Instructions for use of the 
packaged reagent(s) are also typically included. As used herein, the term "package" refers 
to a solid matrix or material such as glass, plastic (e.g., polyethylene, polypropylene or 
polycarbonate), paper, foil and the like capable of holding within fixed limits a polyamide of 
the present invention. Thus, for example, a package can be a glass vial used to contain 
milligram quantities of a contemplated polyamide or it can be a microliter plate well to which 
microgram quantities of a contemplated polypamide have been operatively affixed, i.e., linked 
so as to be capable of being bound by the target DNA sequence. "Instructions for use" typically 
include a tangible expression describing the reagent concentration or at least one assay method 
parameter such as the relative amounts of reagent and sample to be admixed, maintenance time 
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periods for reagent or sample admixtures, temperature, buffer conditions and the like. A 
diagnostic system of the present invention preferably also includes a detectable label and a 
detecting or indicating means capable of signaling the binding of the contemplated polyamide 
of the present invention to the target DNA sequence. As noted above, numerous detectable 
labels, such as biotin,and detecting or indicating means, such as enzyme-linked (direct or 
indirect) streptavidin, are well known in the art. 



Figure 1 shows representative structures of polyamides. ImlmPyPy-y-ImPyPyPy-P-Dp 
(1), ImlmPyPy-y-ImHpPyPy-p-Dp (2), and ImlmHpPy-y-IinPyPyPy-p-Dp (3). (Hp = 3- 
hydroxy-N-methylpyrrole, Im = N-methylimidazole, Py = N-methylpyrrole, p = p-alanine, y = 
y-aminobutyric acid, Dp = Dimethylaminopropylamide). Polyamides were synthesized by solid 
phase methods using Boc-protected 3-methoxypyrrole, imidazole, and pyrrole aromatic amino 
acids, cleaved from the support by aminolysis, deprotected with sodium thiophenoxide, and 
purified by reversed phase HPLC. Baird, E. E. & Dervan, P. B. describes the solid phase 
synthesis of polyamides containing imidazole and pyrrole amino acids. J. Am. Chem. Soc. 118, 
6141-6146 (1996); also see PCT US 97/003332. The identity and purity of the polyamides 
were verified by ! H NMR, analytical HPLC, and matrix-assisted laser-desorption ionization 
time-of-flight mass spectrometry (MALDI-TOF MS-monoisotopic): 1 1223.6 (1223.6 
calculated), 2 1239.6 (1239.6 calculated); 3 1239.6 (1239.6 calculated). 

Figure 2 illustrates binding models for polyamides 1-3 in complex with 5'-TGGTCA-3' 
and 5*-TGGACA-3' (A»T and T»A in fourth position highlighted). Filled and unfilled circles 
represent imidazole and pyrrole rings respectively; circles containing an H represent 3- 
hydroxypyrrole, the curved line connecting the polyamide subunits represents y-aminobutyric 
acid, the diamond represents p-alanine, and the + represents the positively charged 
dimethylaminopropylamide tail group. 



Figure 3 shows quantitative DNase I footprint titration experiments with polyamides 2 
and 3 on the 3' 32 P labeled 250-bp P JK6 EcoBI/Pvull restriction fragment. Lane 1, intact DNA; 
lanes 2-1 1 DNase I digestion products in the presence of 100, 50, 20, 10, 5, 2, 1, 0.5, 0.2, 0.1 
nM polyamide, respectively; lane 12, DNase I digestion products in the absence of polyamide; 
lane 13, adenine-specific chemical sequencing. Iverson, B. L. & Dervan, P. B. describes an 
adenine-specific DNA chemical sequencing reaction. Methods Enzymol 15, 7823-7830 (1987). 



9 



WO 98/37066 



PCTAJS98/01006 



All reactions were done in a total volume of 400 y.L. A polyamide stock solution or H 2 0 was 
added to an assay buffer containing radiolabeled restriction fragment, with the final solution 
conditions of 10 mM Tris-HCl, 10 mM KC1, 10 mM MgCl 2 , 5 mM CaCl 2 , pH 7.0. Solutions 
were allowed to equilibrate for 4-12 h at 22 °C before initiation of footprinting reactions. 
Footprinting reactions; separation of cleavage products, and data analysis were carried out as 
described. White, S., Band, E. E. & Dervan, P. B. Effects of the A.I7T-A degeneracy of 
pyrrole-imidazole polyamide recognition in the minor groove of DNA. Biochemistry 35, 6147- 
6152 (1996). 



10 Figure 4 shows the structure and equilibrium dissociation constant for numerous 

compounds of the present invention. Polyamides are shown in complex with their respective 
match site. Filled and unfilled circles represent imidazole (Im) and pyrrole (Py) rings, 
respectively; circles containing an H represent 3-hydroxypyrrole (Hp), the curved line 
connecting the polyamide subunits represents y-aminobutyric acid (y), the diamond represents 

15 p-alanine (p), and the + represents the positively charged dimethylaminopropylamide tail group 
(Dp). The equilibrium dissociation constants are the average values obtained from three DNase 
I footprint titration experiments. The standard deviation for each set is less than 15% of the 
reported number. Assays were carried out in the presence of 10 mM Tris-HCl, 10 mM KC1, 10 
mM MgCI 2 , and 5 mM CaCl 2 at pH 7.0 and 22°C. 

20 

Figure 5 shows the synthetic scheme for 3-O-methyl-N-Boc protected pyrrole-2- 
carboxylate. The hydroxypyrrole monoester can be prepared in 0.5 kg quantity using published 
procedures on enlarged scale. 

25 Figure 6 shows the solid phase synthetic scheme for ImlmPyPy-Y-ImHpPyPy-P-Dp 

starting from commercially available Boc-P-Pam-Resin: (i) 80% TFA/DCM, 0.4 M PhSH; (ii) 
Boc-Py-OBt, DIEA, DMF; (iii) 80% TFA/DCM, 0.4 M PhSH; (iv) Boc-Py-OBt, DIEA, DMF; 
(v) 80% TFA/DCM, 0.4 M PhSH; (vi) Boc-3-OMe-Py-OH, HBTU, DMF, DIEA; (vii) 80% 
TFA/DCM, 0.4 M PhSH; (viii) Boc-Im-OH, DCC, HOBt; (ix) 80% TFA/DCM, 0.4 M PhSH; 

30 (x) Boc-y-aminobutyric acid, DIEA, DMF; (xi) 80% TFA/DCM, 0.4 M PhSH; (xii) Boc-Py- 
OBt, DIEA, DMF; (xiii) 80% TFA/DCM, 0.4 M PhSH; (xiv) Boc-Py-OBt, DMF, DIEA; (xv) 
80% TFA/DCM, 0.4 M PhSH; (vxi) Boc-Im-OH, DCC, HOBt (xvii) 80% TFA/DCM, 0.4 M 
PhSH; (xviii) imidazole-2-carboxyIic acid, HBTU, DIEA; (xviv) dimethylaminopropylamine, 
55 °C, 18h. Purification by reversed phase HPLC provides IntfmPyPy-y-ImOpPyPy-p-Dp. (Op 

35 = 3-methoxypyrrole). Treatment of the 3-methyoxypyrrole polyamide with thiophenol, NaH, 
DMF, at 100 °C for 120 min provides polyamide 2 after reverse phase HPLC purification. 



10 



WO 98/37066 



PCT/US98/01006 



Figure 7 shows the aromatic region from 7-11 ppm for the 1H-NMR spectrum 
determined at 300 MHz for talmPyPy-y-ImOpPyPy-p-Dp and ImlmPyPy-y-ImHpPyPy-p-Dp. 
This region of the spectrum may be used to determine compound identity and purity. 

5 Figure 8 shows-the MALDI-TOF mass spectrum determined in positive ion mode with a 

monoisotopic detector for the polyamides for ImlmPyPy-y-ImOpPyPy-P-Dp and ImlmPyPy-y- 
ImHpPyPy-p-Dp. This spectrum may be used to determine compound identity and purity. 

Figure 9 shows the methyl group region from 3.5-4.0 ppm for the 1H-NMR spectrum 
10 determined at 300 MHz for ImPyPy-y-OpPyPy^-Dp and ImPyPy-y-HpPyPy-p-Dp. This region 
of the spectrum may be used to directly follow the progress for conversion of 3-methoxypyrrole 
to 3-hydroxypyrroIe. 



Fig. 10 shows quantitative DNase I footprint titration experiments with the polyamides 

15 ImPyPy-y-PyHpPy-P-Dp and ImHpPy^y-PyPyPy-p-Dp on the 3'- 32 P labeled 370-bp pDEHl 
EcdRUPvull restriction fragment. Intact lane, labeled restriction fragment no polyamide or 
DNase I added; lanes 1-10, DNase I digestion products in the presence of 10 fiM, 5 nM, 2 jiM, 
1 *iM, 500 nM, 200 nM, 100 nM, 50 nM, 20 nM, 10 nM ImPyPy-y-PyPyPy-p-Dp, respectively 
or 1 *iM, 500 nM, 200 nM, 100 nM, 50 nM, 20 nM, 10 nM, 5 nM, 2 nM, 1 nM ImHpPy-y- 

20 PyPyPy-P-Dp, respectively; DNase I lane, DNase I digestion products in the absence of 
polyamide; A lane, adenine-specific chemical sequencing. Iverson, B. L. & Dervan, P. B. 
describes an adenine-specific DNA chemical sequencing reaction. Methods EnzymoL 15, 7823- 
7830 (1987). All reactions were done in a total volume of 40 |aL. A polyamide stock solution 
or H 2 0 was added to an assay buffer containing radiolabeled restriction fragment, with the final 

25 solution conditions of 10 mM Tris-HCl, 10 mM KC1, 10 mM MgCl 2 , 5 mM CaCl 2 , pH 7.0. 
Solutions were allowed to equilibrate for 4-12 h at 22 °C before initiation of footprinting 
reactions. Footprinting reactions, separation of cleavage products, and data analysis were 
carried out as described. White, S., Baird, E. E. & Dervan, P. describe the pairing rules for 
recognition in the minor groove of DNA by pyrrole-imidazole polyamides. Chemistry & 

30 Biology 4, 569-578 (1997). 



Figure 1 1 shows the synthesis of a Afunctional polyamide which incorporates the Hp/Py 
pair. Treatment of a sample of ImlmPyPy-y-ImHpPyPy^p-Pam-resin (see Figure 6) with 3,3'- 
diamino-//-methyldipropylamine, 55°C, 18 h followed by reverse phase HPLC purification 
provides the Op polyamide with a free primary amine group which can be coupled to an 
activated carboxylic acid derivative. Treatment with (i) EDTA-dianhydride, DMSO/NMP, 
DIEA, 55 °C; (ii) 0.1M NaOH, followed by reverse phase HPLC purification provides the Op- 
Py-Im-polyamide-EDTA conjugate. Treatment of the 3-methyoxypyrrole polyamide with 
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thiophenol, NaH, DMF, at 100 *C for 120 min provides polyamide 2 after reverse phase HPLC 
purification. 

Figure 12 shows the determination of the binding orientation of hairpin polyamides 
IirdmPyPy-y-IinHpPyPy-p-Dp-EDTA.Fe(n) 2-E.Fe(II) and ImImHpPy-y-ImPyPyPy-p-Dp_ 
EDTA«Fe(H) 3-E»Fe(II) by affinity cleaving footprint titration. Top and bottom left: Affinity 
cleavage experiments on a 3* 32 P labeled 250-bp pJK6 EcoKV Pvu II restriction fragment. The 
5'-TGGACA-3' and 5'-TGGTCA-3' sites are shown on the right side of the autoradiogram. 
Top left: lane 1, adenine-specific chemical sequencing reaction; lanes 2-6, 6.5 uM, 1-0 uM, 
100 nM, 10 nM, 1 nM polyamide 2-E»Fe(II); lane 7, intact restriction fragment, no polyamide 
added. Bottom left: lane 1, A reaction; lanes 2-6, 8.5 uM, 1.0 uM, 100 nM, 10 nM, 1 nM 
polyamide 3-E.Fe(II); lane 7, intact DNA. All reactions were carried out in a total volume of 
40 uL. A stock solution of polyamide or H 2 0 was added to a solution containing 20 kcpm 
labeled restriction fragment, affording final solution conditions of 25 mM Tris-Acetate, 20 mM 
NaCl, 100 uM/ bp calf thymus DNA, at pH 7.0. Solutions were allowed to equilibrate for a 
minimum of 4 h at 22°K before initiation of reactions. Affinity cleavage reactions were carried 
out as described White, S., Baird, E.E. & Dervan, P.B. Effects of the A-T/T-A degeneracy of 
pyrrole-imidazole polyamide recognition in the minor groove of DNA. Biochemistry 35, 6147- 
6152 (1996). Top and bottom right: Affinity cleavage patterns of 2-E»Fe(II) and 3-E»Fe(II) at 
100 nM bound to 5'-TGGACA-3' and 5'-TGGTCA-3\ Bar heights are proportional to the 
relative cleavage intensities at each base pair. Shaded and nonshaded circles denote imidazole 
and pyrrole carboxamides, respectively. Nonshaded diamonds represent the p-alanine moiety. 
A curved line represents the y-aminobutyric acid, and the + represents the positively charged 
dimethylaminopropylamide tail group. The boxed Fe denotes the EDTA-Fe(II) cleavage 
moiety. 

Figure 13 shows quantitative DNase I footprint titration experiments with the 
polyamides ImPyPyPyPy-y-ImPyPyPyPy-p-Dp and IrnHpPyPyPy-y-ImHpPyPyPy-p-Dp on the 
3' 32 P labeled 252-bp pJK7 EcoRl/ Pvu II restriction fragment. For ImPyPyPyPy-y- 
ImPyPyPyPy-P-Dp gel (left): lane 1, DNase I digestion products in the absence of polyamide; 
lanes 2-1 8, DNase I digestion products in the presence of 1 .0 uM, 500 nM, 200, 100, 65, 40, 25, 
15, 10, 6.5, 4.0, 2.5, 1.5, 1.0, 0.5, 0.2, 0.1 nM polyamide; lane 19, DNase I digestion products in 
the absence of polyamide; lane 20, intact restriction fragment; lane 21, guanine-specific 
chemical sequencing reaction; lane 22, adenine-specific chemical sequencing reaction. For 
ImHpPyPyPy-y-ImHpPyPyPy-p-Dp gel (right): lane 1, intact DNA; lane 2, DNase I digestion 
products in the absence of polyamide; lanes 3-19, 1 .0 p.M, 500 nM, 200, 100, 50, 20, 10, 5, 2, 1, 
0.5, 0.2. 0.1, 0.05, 0.01, 0.005, 0.001 nM polyamide; lane 20, DNase I digestion products in the 
absence of polyamide; lane 21, A reaction. All reactions were done in a total volume of 400 
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\xL. A polyamide stock solution or H 2 0 was added to an assay buffer containing radiolabeled 
restriction fragment, with the final solution conditions of 10 mM Tris-HCl, 10 mM KCI, 10 mM 
MgCl 2 , 5 mM CaCl 2 , pH 7.0. Solutions were allowed to equilibrate for 4-12 h at 22°C before 
initiation of footprinting reactions. Footprinting reactions, separation of cleavage products, and 
5 data analysis were carried as described. White, S., Baird, & Dervan, P.B. Effects of the 
A-TVT»A degeneracy of pyrrole-imidazole polyamide recognition in the minor groove of DNA. 
Biochemistry 35, 6147-6152 (1996). 

Fig. 14 shows the 8-ring Hp-Py-Im-polyamide hairpins described by the pairing code of 
10 the present invention. The eight ring hairpin template is shown at the top. A polyamide having 
the formula XiXjXsXry-XsX^Xg wherein y is the -NH-CH 2 -CH 2 -CH 2 -CONH- hairpin 
linkage derived from y-aminobutyric acid or a chiral hairpin linkage derived from R-2,4- 
diaminobutyric acid; X4AX5, X 3 /X 6 , X 2 /X 7 , and X,/X 8 represent carboxamide binding pairs 
which bind the DNA base pairs. The minor groove sequence to be bound is represented as 5'- 
15 WGTNNW-3% where the 5'-GTNN-3* core sequence is defined as position a, b, c, and d (W = 
A or T, N = A, G, C, or T). A linear sequence of aromatic amino acids fills the hairpin template 
in order to satisfy the ring pairing requirements to correspond to the DNA base pairs in the 
minor groove to be bound. The ring pairing code as applied is listed in Table 2. The 16 unique 
hairpin polyamides which target 16 5*-WGTNNW-3* sequences are drawn as binding models 
20 where filled and unfilled circles represent imidazole and pyrrole rings respectively; circles 
containing an H represent 3-hydroxypyrroIe, and the curved line connecting the polyamide 
subunits represents y-aminobutyric acid. 



Fig, 15 shows the 8-ring Hp-Py-Im-polyamide hairpins described by the pairing code of 
the present invention. The eight ring hairpin template is shown at the top. A polyamide having 
the formula X 1X2X3X4^X5X5X7X8 wherein y is the -NH-CH 2 -CH 2 -CH 2 -CONH- hairpin 
linkage derived from y-aminobutyric acid or a chiral hairpin linkage derived from R-2,4- 
diaminobutyric acid; X4/X5, X 3 /X 6 , X 2 /X 7 , and X^Xg represent carboxamide binding pairs 
which bind the DNA base pairs. The minor groove sequence to be bound is represented as 5'- 
WGANNW-3% where the 5'-GANN-3' core sequence is defined as position a, b, c, and d (W = 
A or T, N = A, G, C, or T). A linear sequence of aromatic amino acids fills the hairpin template 
in order to satisfy the ring pairing requirements to correspond to the DNA base pairs in the 
minor groove to be bound. The ring pairing code as applied is listed in Table 2. The 16 unique 
hairpin polyamides which target 16 5'-WGANNW-3> sequences are drawn as binding models 
where filled and unfilled circles represent imidazole and pyrrole rings respectively; circles 
containing an H represent 3-hydroxypyrrole, and the curved line connecting the polyamide 
subunits represents y-aminobutyric acid. 
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Fig. 16 shows the 8-ring Hp-Py-Im-polyamide hairpins described by the pairing code of 
the present invention. The eight ring hairpin template is shown at the top. A poiyamide having 
the formula X 1 X 2 X 3 X4-y-X 5 X 6 X 7 X 8 wherein y is the -NH.CH 2 -CH 2 -CH 2 -CONH- hairpin 
linkage derived from y-aminobutyric acid or a chiral haiipin linkage derived from R-2,4- 
diaminobutyric acid; X4/X5, X 3 /X 6 , X 2 /X 7> and X^ represent carboxamide binding pairs 
which bind the DNA base pairs. The minor groove sequence to be bound is represented as 5'- 
WGGNNW-3\ where the 5'-GGNN-3* core sequence is defined as position a, b, c, and d (W = 
A or T, N = A, G, C, or T). A linear sequence of aromatic amino acids fills the hairpin template 
in order to satisfy the ring pairing requirements to correspond to the DNA base pairs in the 
minor groove to be bound. The ring pairing code as applied is listed in Table 2. The 16 unique 
hairpin polyamides which target 16 5*-WGGNNW-3* sequences are drawn as binding models 
where filled and unfilled circles represent imidazole and pyrrole rings respectively; circles 
containing an H represent 3-hydroxypyrrole, and the curved line connecting the poiyamide 
subunits represents y-aminobutyric acid. 



Fig. 17 shows the 8-ring Hp-Py-Im-polyamide hairpins described by the pairing code of 
the present invention. The eight ring hairpin template is shown at the top. A poiyamide having 
the formula XiX^X^y-XsX^Xg wherein y is the -NH-CH 2 -CH 2 -CH 2 -CONH- hairpin 
linkage derived from y-aminobutyric acid or a chiral hairpin linkage derived from R-2,4- 
diaminobutyric acid; X4/X5, X 3 /X 6 , X 2 /X 7 , and X x /X^ represent carboxamide binding pairs 
which bind the DNA base pairs. The minor groove sequence to be bound is represented as 5'- 
WGCNNW-3\ where the 5*-GCNN-3' core sequence is defined as position a, b, c, and d (W ~ 
A or T, N = A, G, C, or T). A linear sequence of aromatic amino acids fills the haiipin template 
in order to satisfy the ring pairing requirements to correspond to the DNA base pairs in the 
minor groove to be bound. The ring pairing code as applied is listed in Table 2. The 16 unique 
hairpin polyamides which target 16 5 5 -WGCNNW-3' sequences are drawn as binding models 
where filled and unfilled circles represent imidazole and pyrrole rings respectively; circles 
containing an H represent 3-hydroxypyrrole, and the curved line connecting the poiyamide 
subunits represents y-aminobutyric acid. 



Four-ring poiyamide subunits, covalently coupled to form eight-ring hairpin structures, 
bind specifically to 6-bp target , sequences at subnanomolar concentrations. Trauger, J.W., 
Baird, E. E. & Dervan, P.B. describe the recognition of DNA by designed ligands at 
subnanomolar concentrations. Nature 382, 559-561 (1996); Swalley, S. E., Baird, E. E. & 
Dervan, P. B. describe the discrimination of 5*-GGGG~3\ 5*-GCGC-3\ and 5*-GGCC*3* 
sequences in the minor groove of DNA by eight-ring hairpin polyamides. /. Am, Chem. Soc. 
119, 6953-6961 (1997). The DNA-binding affinities of three eight-ring hairpin polyamides 
shown in Figure 1 as compound 1, 2, and 3 containing pairings of Im/Py, Py/Im opposite G»C, 
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C*G and either Py/Py, Hp/Py, or Py/Hp at a common single point opposite T*A and A*T has 
been determined. Equilibrium dissociation constants (KJ) for ImlmPyPy-y-ImPyPyPy-p^Dp 1, 
ImlmPyPy-y-ImHpPyPy-p-Dp 2, ImlmHpPy.y-IniPyPyPy-p-Dp 3 of Figure 1 are shown in 
Table L Brenowitz, M., Senear, D. F., Shea, M. A. & Ackers, G- K. describe a quantitative 
DNase footprint titration method for studying protein-DNA interactions. Methods EnzymoL 
130, 132-181 (1986); The values were determined by quantitative DNase I footprint 
titration experiments: on a 3* 32 P-labeled 250-bp DNA fragment containing the target sites, 5*- 
TGGACA-3* and 5*-TGGTCA-3' which differ by a single A*T base pair in the fourth position. 
The DNase footprint gels are shown in Figure 3. 

TABLE I Equilibrium dissociation constants* 



Polyamidet 5'-TGGTCA-3' 5'-TGGACA-3' K n \t 
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C A-3* 
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^ = 0.48 nM K d =37 nM 



*The reported dissociation constants are the average values obtained from three 
DNase I footprint titration experiments. The standard deviation tor each data set is 
less than 15% of the reported number. Assays were carried out in the presence of 10 
mM Tris-HCI, 10 mM KC1, 10 mM MgCl 2 , and 5 mM CaCl 2 at pH 7.0 and 22 °C. 
tRing pairing opposite T*A and A*T in the fourth position. 
tCalcuiated as K d (5'~TGGACA-3')/K d (5'-TGGTC A~3'>. 

Based on the pairing rules for polyamide-DNA complexes both of these sequences are a 
match for control polyamide 1 which places a Py/Py pairing opposite 

A*T and 1>A at both sites. It was determined that in polyamide 1 (Py/Py) binds to 5'- 
TGGXCA-3* and 5*-TGGACA-3' within a factor of 2 (K^ - 0.077 or 0.15 nM respectively). In 
contrast, polyamide 2 (Py/Hp) binds to 5'-TGGJCA-3' and S'-TGGACA^' with dissociation 
constants which differ by a factor of 18 (K^ = 15 nM and 0.83 nM respectively). By reversing 
the pairing in polyamide 3 (Hp/Py) the dissociation constants differ again in the opposite 
direction by a factor of 77 (K 0 = 0.48 nM and 37 nM respectively. Control experiments 
performed on separate DNA fragments; reveal that neither a 5'-TGGGCA-3' or a 5'-TGGCCA- 
3' site is bound by polyamide 2 or 3 at concentrations < 100 nM, indicating that the Hp/Py and 
Py/Hp ring pairings do not bind opposite G»C or OG. The A*T vs. T»A discrimination is 
achieved preferably when the two neighboring base pairs are G«C and OG (GTC vs. GAC). 
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The specificity of poiyamides 2 and 3 for sites which differ by a single A*T/T>A base 
pair results from small chemical changes. Replacing the Py/Py pair in 1 with a Py/Hp pairing 
as in 2, a single substitution of C3-OH for C3-H, destabilizes interaction with 5'-TGGICA-3' 
by 191-fold, a free energy difference of 3.1 kcal mol' 1 . Interaction of 2 with S'-TGGACA-S' is 
destabilized only 6-fold relative to 1, a free energy difference of 1.1 kcal mol" 1 . Similarly, 
replacing the Py/Py pair in 1 with Hp/Py as in 3 destabilizes interaction with 5'-TGGACA-3' 
by 252-fold, a free energy difference of 3.2 kcal mof 1 . Interaction of 3 with S'TGGTCA^' is 
destabilized only 6-fold relative to 1, a free energy difference of LO kcal mol" 1 . 



The poiyamides of this invention provide for coded targeting of predetermined DNA 
sequences with affinity and specificity comparable to sequence-specific DNA binding proteins. 
Hp, Im, and Py poiyamides complete the minor groove recognition code using three aromatic 
amino acids which combine to form four ring pairings (Im/Py, Py/Im, Hp/Py, and Py/Hp) which 
complement the four Watson-Crick base pairs, as shown in TABLE 2. There are a possible 240 
15 four base pair sequences which contain at least 1 A*T or T^A base pair and therefore can 
advantageously use an Hp/Py, or Py/Hp carboxamide binding. Poiyamides binding to any of 
these sequences can be designed in accordance with the code of TABLE 2. 



TABLE 2 Pairing code for minor groove recognition* 
Pair G»C OG T»A A»T 

Im/Py + 

Py/Im -f 

Hp/Py + 

Py/Hp + 



* favored (+), disfavored (-) 
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For certain G'C rich sequences the affinity of polyamide-DNA complexes may be 
enhanced by substitution of an Im/p pair for Im/Py at G»C and p/Im for Py/Im at OG. At A»T 
and T-A base pairs, either a Py/p, p/Py, and p/p may be used. The alternate aliphatic/aromatic 
amino acid pairing code is described in Table 3. 



TABLE 3 Aliphatic/ Aromatic substitution for ring 


pairings* 




Pair 


Substitution 


Im/Py 


Im/p 


Py/Im 


P/Im 


Hp/Py 


Py/p,p/Py,Hp/p, p/p 


Py/Hp 


Py/p, p/Py, p/Hp,p/p 



U. S. Patent 5,578,444 describes numerous promoter region targeting sequences from 
which base pair sequences for targeting a polyamide can be identified. 

PCT U.S. 97/003332 describes methods for synthesis of polyamides which are suitable 
for preparing polyamides of this invention. The use of p-alanine in place of a pyrrole amino 
acid in the synthetic methods provides aromatic/aliphatic pairing (Im/p, p/bn, Py/p, and p/Py) 
and aliphatic/aliphatic pairing (p/p) substitution. The use of y-aminobutyric acid, or a 
substituted y-aminobutyric acid such as (R)-2,4 diaminobutyric acid, provides for preferred 
hairpin turns. The following examples illustrate the synthesis of polyamides of the present 
invention. 
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Example 1: 

PREPARATION OF A PROTECTED Hp MONOMER FOR SOLID PHASE 

SYNTHESIS. 

Distamycin and its analogs have previously been considered targets of traditional 
5 multistep synthetic chemistry. Arcamone, R, Orezzi, P. G., Barbieri, W., Nicolella, V. & Penco, 
S. describe a solution phase synthesis of distamycin Gazz. Chim. Ital. 1967, 97, 1097. The 
repeating amide of distamycin is formed from an aromatic carboxylic acid and an aromatic 
amine. The aromatic acid is often unstable to decarboxylation, and the aromatic amines have 
been found to be air and light sensitive. Lown, J. W. & Krowicki, K. describe a solution phase 

10 synthesis of Distamycin J. Org. Chem. 1985, 50, 3774. The variable coupling yields, long 
reaction times (often >24 h), numerous side products, and reactive intermediates (acid chlorides 
and trichloro ketones) characteristic of the traditional solution phase coupling reactions make 
the synthesis of the aromatic carboxamides problematic. B. Merrifield describes the solid phase 
synthesis of a tetrapeptide J. Am, Chem. Soc. 1963, 85, 2149. In order to implement an efficient 

15 solid phase methodology for the synthesis of the pyrrole- imidazole polyamides, the following 
components were developed: (1) a synthesis which provides large quantities of appropriately 
protected monomer or dimer building blocks in high purity, (2) optimized protocols for forming 
an amide in high yield from a support-bound aromatic amine and an aromatic carboxylic acid, 
(3) methods for monitoring reactions on the solid support, and (4) a stable resin linkage agent 

20 that can be cleaved in high yield upon completion of the synthesis. Baird, E. E. & Dervan, P. B. 
describes the solid phase synthesis of polyamides containing imidazole and pyrrole amino 
acids. J. Am. Chem. Soc. 118, 6141-6146 (1996); also see PCT US 97/003332. In order to 
prepare polyamides which contain the 3-hydroxypyrrole monomer, a synthesis has been 
developed which allows the appropriately protected Boc-Op acid monomer to be prepared on 50 

25 g scale. *H NMR and ™C NMR spectra were recorded on a General Electric-QE 300 NMR 
spectrometer in CD 3 OD or DMSO-</ 6 , with chemical shifts reported in parts per million relative 
to residual CHD 2 OD or DMSO-^5> respectively. IR spectra were recorded on a Perkin-Elmer 
FTIR spectrometer. High-resolution mass spectra were recorded using fast atom bombardment 
(FABMS) techniques at the Mass Spectrometry Laboratory at the University of California, 

30 Riverside. Reactions were executed under an inert argon atmosphere. Reagent grade chemicals 
were used as received unless otherwise noted. Still, W. C, Kahn, M & Mitra, A. describe flash 
column chromatography J. Org. Chem. 1978, 40, 2923-2925. Flash chromatography was 
carried out using EM science Kieselgel 60 (230-400) mesh. Thin-layer chromatography was 
performed on EM Reagents silica gel plates (0.5 mm thickness). All compounds were 

35 visualized with short-wave ultraviolet light. 
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Table 4 rlntermediates for preparation of Boc-protected 3-me thoxypyrroIe 
NAME STRUCTURE 



Ethyl 4-carboxy-3-hydfoxy-l- 
methylpyrroIe-2-carboxylate. 



Ethyl 4-[(Benzyloxycarbonyl)axnino]-3- 
hydroxy-l-methylpyrrole-2-carboxylate 



Ethyl 4-[(Benzyloxycarbonyl)amino]-3- 
methoxy-l-methylpyrrole-2-carboxylate 



Ethyl 4-[(tert-Butyloxycarbonyl)amino]-3- 
medioxy-l^methylpyrrole-2-carboxylate 



4-[(tert-Butyloxycarbonyl)amino]-3-methoxy 
-l-methylpyrrole-2-carboxylic acid 




Ethyl 4-[(benzyloxycarbonyl)amino]-3-hydroxy-l~methylpyrrote^^ Ethyl-4- 
carboxy-3-hydroxy-l-methylpyrrole-2-carboxylate (60 g, 281.7 mmol) was dissolved in 282 
mL acetonitrile. TEA (28.53 g, 282 mmol) was added, followed by diphenylphosphorylazide 
(77.61 g, 282 mmol). The mixture was refluxed for 5 hours, followed by addition of benzyl 
alcohol (270 ml) and reflux continued overnight. The solution was cooled and volitiles 
removed in vacuo. The residue was absorbed onto silca and chromatagraphed, 4:1 hexanes : 
ethyl acetate, to give a white solid (21 .58 g, 24%) *H NMR (DMSO-d6) 5 8,73 (s, 1H), 8.31 (s, 
1H), 7.31 (m, 5H)> 6.96 (s, 1H), 5.08 (s, 2H), 4.21 (q, 2H, J = 7.1 Hz), 3.66 (s, 3H), 1.25 (t, 3h! 
J = 7.1 Hz); MS m/e 319.163 (M+H 319.122 calcd. for C16H18N2O5). 

Ethyl 4-[(tert~butoxycarbonyl)amino]-3-methoxy-l^ carboxylate. Ethyl 

4-[(benzyloxycarbonyl)amino^ (13.4 g> 42.3 mmol) 

was dissolved in 110 mL acetone. Anhydrous K2CO3 (11.67 g, 84.5 mmol) was added, 
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followed by methyliodide (5,96 g, 42.3 mmol) and dimethylaminopyridine (0.5 g, 4.23 mmol) 
and the mixture stirred overnight. The solid K2CO3 was removed by filtration and 200 ml 
water added. Volitiles were removed in vacuo and the solution made acidic with addition of IN 
H2SO4 . The aqueous layer was extracted with diethyl ether. Organic layers were combined, 
5 washed with 10% H2SO4, dried over MgS04, and dried to give a white solid. The solid was 
used without further purification and dissolved in 38 ml DMF. DIEA (1 1 ml), Boc anhydride 
(9.23 g, 42.3 mmol), and 10 % Pd/C (500 mg) were added and the solution stirred under 
hydrogen (1 atm) for 2.1 h. The slurry was filtered through celite which was washed with 
methanol. Water 250 ml was added and volitiles removed in vacuo. The aqueous layer was 
10 extracted with ether. Organic layers were combined, washed with water and brine, and dried 
over MgS04. Solvent was removed in vacuo to give a white solid ( 8.94 g, 71%) *H NMR 
(DMSO-d 6 ) 5 8.43 (s, 1H), 7.03 (s, 1H), 4.19 (q, 2H, J = 7.1 Hz), 3.70 (s, 3H), 3.67 (s, 3H), 
1.42 (s, 9H), 1.26 (t, 3H, J - 7.1); MS m/e 299.161 (M+H 299.153 calcd. for C14H22N2O5). 

15 Ethyl 4-[(benzyloxycarbonyl)amino]-3-hydroxy-l-methy Ethyl 4- 

[(rerr-butoxycarbonyl)amino]-3-methoxy-l-methylpyrroIe-2-carboxylate (9.0 g, 30.2 mmol) 
was dissolved in 30 mL ethanol NaOH (30 ml, 1 M, aq) was added and the solution stirred for 
4 days. Water (200 ml) was added and ethanol removed in vacuo. The solution was extracted 
with diethyl ether, aqueous layer acidified to pH = 2-3, and extracted again with diethyl ether. 

20 Organic layers were dried over MgS04, and solvent removed in vacuo to give a white solid (6.0 
g, 20.5 mmol, 87% based on recovered SM) *H NMR (DMSO-d6) 5 12.14 (s, 1H), 837 (s, 
1H), 6.98 (s, 1H), 3.69 (s, 3H), 3.66 (s, 3H), 1.42 (s, 9H); MS m/e 293.112 (M+H 293.104 
calcd. for C12H1 8N2O5). 

25 EXAMPLE 2: 

SOLID PHASE SYNTHESIS OF 3-HYDROXYPYRROLE POLY AMIDES. 

Cycling protocols were optimized to afford high stepwise coupling yields (>99%). 
Deprotection by aminolysis affords up to 100 mg quantities of polyamide. Solid phase 

30 polyamide synthesis protocols were modified from the in situ neutralization Boc-chemistry 
protocols. Schnolzer, M, Alewood, P., Jones, A., Alewood, D., Kent, S.B.H. report rapid in situ 
neutralization for solid phase peptide synthesis Int J. Peptide. Protein, Res, 1992, 40, 180. 
Coupling cycles are rapid, 72 min per residue for manual synthesis or 180 min per residue for 
machine-assisted synthesis, and require no special precautions beyond those used for ordinary 

35 solid phase peptide synthesis. Manual solid phase synthesis of a pyrrole-imidazole polyamide 
consists of a dichloromethane (DCM) wash, removal of the Boc group with trifluoroacetic acid 
(TFA)/DCM/thiophenol (PhSH), a DCM wash, a DMF wash, taking a resin sample for analysis, 
addition of activated monomer, addition of DIEA if necessary, coupling for 45 min, taking a 
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resin sample for analysis, and a final DMF wash (Figure 5, Table I). In addition, the manual 
sohd phase protocol for synthesis of pyrrole-imidazole polyamides has been adapted for use on 
a ABI 430A peptide synthesizer. The aromatic amine of the pyrrole and imidazole do not react 
in the quantitative lunhydrin test Stepwise cleavage of a sample of resin and analysis by HPLC 
indicates that high stepwise yields (> 99%) are routinely achieved. 

Dicyclohexylcarbodiimide (DCC), Hydroxybenzotriazole (HOBt), 2-(lH-Benzotriazole- 
l-yl)-l,l,3,3-tetramethyluronium hexa-fluorophosphate (HBTU) and 0.2 mmol/gram Boc-p- 
alamne-H^arboxaimdomemyl^ resin (Boc-p- 

Pam-Resin) was purchased from Peptides International (0.2 mmol/gram), NovaBiochem (0.6 
mmol/gram), or Peninsula (0.6 mmol/gram). ( (*)-2-Fmoc-4-Boc-diaminobutyric acid, (5)-2- 
Fmoc-4-Boc-diaminobutyric acid, and (/c)-2-ammo-4-Boc-diaminobutyric acid were purchased 
from Bachem. Wdiisopropylethylamine (DDSA), Wciimethylformamide (DMF) N- 
methylpyrrolidone (NMP), DMSO/NMP, Acetic anhydride (Ac 2 0), and 0.0002 M potassium 
cyanide/pyridine were purchased from Applied Biosystems. Dichloromethane (DCM) and 
triethylamine (TEA) were reagent grade from EM, thiophenol (PhSH), 
dimethylaminopropylamine (Dp), Sodium Hydride, (i?)-a-methoxy-a- 
(triruoromethyl)phenylacetic acid ((*)MPTA) and (5)-a-methoxy-a- 
(trifouromethyl)phenylacetic acid ((S)MPTA) were from Aldrich, trifluoroacetic acid (TFA) 
Biograde from Halocarbon, phenol from Fisher, and ninhydrin from Pierce. All reagents were 
used without further purification. 

Quik-Sep polypropylene disposable filters were purchased from Isolab Inc. 'h NMR 
spectra were recorded on a General Electric-QE NMR spectrometer at 300 MHz with chemical 
shifts reported in parts per million relative to residual solvent. UV spectra were measured in 
water on a Hewlett-Packard Model 8452A diode array spectrophotometer. Optical rotations 
were recorded on a JASCO Dip 1000 Digital Polarimeter. Matrix-assisted, laser 
desorption/ionization time of flight mass spectrometry (MALDI-TOF) was performed at the 
Protein and Peptide Microanalytical Facility at the California Institute of Technology. HPLC 
analysis was performed on either a HP 1090M analytical HPLC or a Beckman Gold system 
using a RAINEN C l8> Microsorb MV, 5pm, 300 x 4.6 mm reversed phase column in 0.1% 
(wt/v) TFA with acetonitrile as eluent and a flow rate of 1.0 mL/min, gradient elution 1.25% 
acetonitrile/min. Preparatory reverse phase HPLC was performed on a Beckman HPLC with a 
Waters DeltaPak 25 x 100 mm, 100 urn C18 column equipped with a guard, 0.1% (wt/v) TFA, 
0.25% acetonitrile/min. 18MQ water was obtained from a Millipore MilliQ water purification 
system, and all buffers were 0.2 um filtered. 
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Activation of Boc-3-methoxypy rroie acid. The amino acid (0.5 mmol) was dissolved in 2 mL 
DMF. HBTU (190 mg, 0.5 mmol) was added followed by DIEA (1 mL) and the resulting 
mixture was shaken for 5 min. 

5 Activation of Imidazole-2-carboxyIic acid, y-aminobutyric acid, Boc-glycine, and Boc-p- 
alanine. The appropriate amino acid or acid (2 mmol) was dissolved in 2 mL DMF. HBTU 
(720 mg, 1.9 mmol) was added followed by DIEA (1 mL) and the solution shaken for at least 5 
min. 

10 Activation of Boc-Imidazole acid. Boc imidazole acid (257 mg, 1 mmol) and HOBt (135 mg, 
1 mmol) were dissolved in 2 mL DMF, DCC (202 mg, i mmol) is then added and die solution 
allowed to stand for at least 5 min. 

Acetylation Mix. 2 mL DMF, DIEA (710 fiL, 4.0 mmol), and acetic anhydride (380 |iL, 4.0 
15 mmol) were combined immediately before use. 

Manual Synthesis Protocol. Boc-B-alanine-Pam-Resin (1.25 g, 0,25 mmol) is placed in a 20 
mL glass reaction vessel, shaken in DMF for 5 min and the reaction vessel drained. The resin 
was washed with DCM (2 x 30 s.) and the Boc group removed with 80% TFA/DCM/0.5M 

20 PhSH, 1 x 30s., 1 x 20 min The resin was washed with DCM (2 x 30 s.) followed by DMF (1 x 
30 s.) A resin sample (5-10 mg) was taken for analysis. The vessel was drained completely and 
activated monomer added, followed by DIEA if necessary. The reaction vessel was shaken 
vigorously to make a slurry. The coupling was allowed to proceed for 90 min, and a resin 
sample taken. Acetic anhydride (1 mL) was added and the reaction shaken for 5 min. The 

25 reaction vessel was then washed with DMF, followed by DCM. 

Machine-Assisted Protocols. Machine-assisted synthesis was performed on a ABI 430A 
synthesizer on a 0.18 mmol scale (900 mg resin; 0.2 mmol/gram). Each cycle of amino acid 
addition involved: deprotection with approximately 80% TFA/DCM/0.4M PhSH for 3 minutes, 

30 draining the reaction vessel, and then deprotection for 17 minutes; 2 dichloromethane flow 
washes; an NMP flow wash; draining the reaction vessel; coupling for 1 hour with in situ 
neutralization, addition of dimethyl sulfoxide (DMSO)/NMP, coupling for 30 minutes, addition 
of DIEA, coupling for 30 minutes; draining the reaction vessel; washing with DCM, taking a 
resin sample for evaluation of the progress of the synthesis by HPLC analysis; capping with 

35 acetic anhydride/DIEA in DCM for 6 minutes; and washing with DCM. A double couple cycle 
is employed when coupling aliphatic amino acids to imidazole, all other couplings are 
performed with single couple cycles. 
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The ABI 430A synthesizer was left in the standard hardware configuration for NMP- 
HOBt protocols. Reagent positions 1 and 7 were DIEA, reagent position 2 was TFA/0.5M 
thiophenol, reagent position 3 was 70% ethanoiamine/methanol, reagent position 4 was acetic 
anhydride, reagent position 5 was DMSO/NMP, reagent position 6 was methanol, and reagent 
5 position 8 was DMFr New activator functions were written, one for direct transfer of the 
cartridge contents to the concentrator (switch list 21, 25, 26, 35, 37, 44), and a second for 
transfer of reagent position 8 directly to the cartridge (switch list 37, 39, 45, 46). 

Boc-Py-OBt ester (357 mg, 1 mmol) was dissolved in 2 ml DMF and filtered into a 
10 synthesis cartridge. Boc-Im acid monomer was activated (DCC/HOBt), filtered, and placed in a 
synthesis cartridge. Imidazole-2-carboxylic acid was added manually. At the initiation of the 
coupling cycle the synthesis was interrupted, the reaction vessel vented and the activated 
monomer added directly to the reaction vessel through the resin sampling loop via syringe. 
When manual addition was necessary an empty synthesis cartridge was used. Aliphatic amino 
15 acids (2 mmol) and HBTU (1.9 mmol) were placed in a synthesis cartridge. 3 ml of DMF was 
added using a calibrated delivery loop from reagent bottle 8, followed by calibrated delivery of 
1 ml DIEA from reagent bottle 7, and a 3 minute mixing of the cartridge. 

The activator cycle was written to transfer activated monomer directly from the cartridge to 
20 the concentrator vessel, bypassing the activator vessel. After transfer, 1 ml of DIEA was 
measured into the cartridge using a calibrated delivery loop, and the DIEA solution combined 
with the activated monomer solution in the concentrator vessel. The activated ester in 2:1 
DMF/D1EA was then transferred to the reaction vessel. All lines were emptied with argon 
before and after solution transfers. 

25 

ImImOpPy-y-ImPyPyPy-$-Dp. ImlmOpPy-y-ImPyPyPy-p-Pam-Resin was synthesized 
in a stepwise fashion by machine-assisted solid phase methods from Boc-P-Pam-Resin (0.66 
mmol/g). Baird, E. E. & Dervan, P. B. describes the solid phase synthesis of polyamides 
containing imidazole and pyrrole amino acids. J. Am. Chem. Soc. 118, 6141-6146 (1996); also 

30 see PCT US 97/003332. 3-hydroxypyrrole-Boc-amino acid (0.7 mmol) was incorporated by 
placing the amino acid (0.5 mmol) and HBTU (0.5 mmol) in a machine synthesis cartridge. 
Upon automated delivery of DMF (2 mL) and DIEA (1 mL) activation occurs. A sample of 
ImlmOpPy-y-ImPyPyPy-p-Pam-Resin (400 mg, 0.40 mmol/gram) was placed in a glass 20 mL 
peptide synthesis vessel and treated with neat dimethylaminopropylamine (2 mL) and heated 

35 (55 °C) with periodic agitation for 16 h. The reaction mixture was then filtered to remove resin, 
0.1% (wt/v) TFA added (6 mL) and the resulting solution purified by reversed phase HPLC. 
ImlmOpPy-Y-ImPyPyPy-p-Dp is recovered upon lyophilization of the appropriate fractions as a 
white powder (97 mg, 49% recovery). UV (H 2 0) ?w 246, 316 (66,000); ! H NMR (DMSO-d tf ) 
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5 10.24 (s, 1 H), 10.14 (s, 1 H), 9.99 (s, 1 H), 9.94 (s, 1 H), 9.88 (s, 1 H), 9.4 (br s, 1 H), 9.25 (s, 
1 H), 9.1 1 (s, 1 H), 8.05 (m, 3 H), 7.60 (s, 1 H), 7.46 (s, 1 H), 7.41 (s, 1 H), 7.23 (d, 1), 7.21 (d,' 
1 H), 7.19 (d, 1 H), 7.13 (m, 2 H), 7.11 (m, 2 H), 7.02 (d, 1 H), 6.83 (m, 2 H), 3.96 (s, 6 H)' 
3.90 (s, 3 H), 3.81 (m, 6 H), 3.79 (s, 3 H), 3.75 (d, 9 H), 3.33 (q, 2 H, y= 5.4 Hz), 3.15 (q, 2 
5 J = 5.5 Hz), 3.08 (q, 2 H, ./= 6.0 Hz), 2.96 (quintet, 2 H, J= 5.6 Hz), 2.70 (d, 6 H, ./= 4.5 Hz),' 
2.32 (m, 4 H), 1.71 (m, 4 H); MALDI-TOF-MS (monoisotopic), 1253.5 (1253.6 calc. for 
C 58 H 72 N 22 0„). 

ImlmHpPy-y-ImPyPyPy. In order to remove the methoxy protecting group, a sample of 

10 ImlmOpPy-y-ImPyPyPy-p-Dp (5 mg, 3.9 umol) was treated with sodium thiophenoxide at 100 
°C for 2 h. DMF (1000 uL) and thiophenol (500 uL) were placed in a (13 x 100 mm) disposable 
Pyrex screw cap culture tube. A 60 % dispersion of sodium hydride in mineral oil (100 mg) was 
slowly added. Upon completion of the addition of the sodium hydride, ImlmOpPy-y-ImPyPyPy- 
p-Dp (5 mg) dissolved in DMF (500 uL) was added. The solution was agitated, and placed in a 

15 100 °C heat block, and deprotected for 2 h. Upon completion of the reaction the culture tube 
was cooled to 0°C, and 7 ml of a 20 % (wt/v) solution of trifluoroacetic acid added. The 
aqueous layer is separated from the resulting biphasic solution and purified by reversed phase 
HPLC. ImlrnHpPy-y-ImPyPyPy-p-Dp is recovered as a white powder upon lyophilization of 
the appropriate fractions (3.8 mg, 77 % recovery). UV (H 2 0) 246, 312 (66,000); 'h NMR 

20 (DMSO-40 5 10.34 (s, 1 H), 10.24 (s, 1 H), 10.00 (s, 2 H), 9.93 (s, 1 H), 9.87 (s, 1 H), 9.83 (s, 
1 H), 9.4 (br s, 1 H), 9.04 (s, 1 H), 8.03 (m, 3 H), 7.58 (s, 1 H), 7.44 (s, 1 H), 7.42 (s, 1 H), 7.23 
(s, 1 H), 7.20 (m, 3 H), 7.12 (m, 2 H), 7.05 (d, 1 H), 7.02 (d, 1 H), 6.83 (s, 1 H), 6.79 (s, 1 H), 
3.96 (s, 6 H), 3.90 (s, 3 H), 3.81 (s, 6 H), 3.79 (s, 3 H), 3.75 (d, 6 H), 3.33 (q, 2 H, J= 5.4 Hz)' 
3.14 (q, 2H,J= 5.4 Hz), 3.08 (q, 2 H, /= 6.1 Hz), 2.99 (quintet, 2 H, J= 5.4 Hz), 2.69 (d, 6 H, 

25 J = 4.2 Hz), 2.31 (m, 4 H), 1.72 (m, 4 H); MALDI-TOF-MS (monoisotopic), 1239.6 (1239.6 
calc. for C57H71N22O1]). 

ImlmPyPy-y-ImOpPyPy-P-Dp. IinlmPyPy-y-ImOpPyPy-p-Pam-Resin was synthesized 
in a stepwise fashion by machine-assisted solid phase methods from Boc-p-Pam-Resin (0.66 

30 mmol/g) as described for ImlmOpPy-y-ImPyPyPy-p-Dp. A sample of ImlmPyPy-y-ImOpPyPy- 
p-Pam-Resin (400 mg, 0.40 mmol/gram) was placed in a glass 20 mL peptide synthesis vessel 
and treated with neat dimethylaminopropylamine (2 mL) and heated (55 °C) with periodic 
agitation for 16 h. The reaction mixture was then filtered to remove resin, 0.1% (wt/v) TFA 
added (6 mL) and the resulting solution purified by reversed phase HPLC. ImlmPyPy-y- 

35 ImOpPyPy-p-Dp is recovered upon lyophilization of the appropriate fractions as a white 
powder (101 mg, 50% recovery). UV (H 2 0) ^ 246, 316 (66,000); MALDI-TOF-MS 
(monoisotopic), 1253.6 (1253.6 calc. for C58H 72 N 22 O u ). 
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ImlmPyPy-y-ImffpPyPy. A sample of ImlmPyPy-y-ImOpPyPy-p-Dp (5 rag, 3.9 umol) 
was treated with sodium thiophenoxide and purified by reversed phase HPLC as described for 
ImImHpPy-y-ImPyPyPy.3-Dp. ImlmPyPy-y-ImHpPyPy-p-Dp is recovered upon lyophilization 
of the appropriate fractions as a white powder (3.2 mg, 66 % recovery). UV (H 2 0) 246, 
5 3 12 (66,000); MALDI-TOF-MS (monoisotopic), 1239.6 (1239.6 calc. for C 57 H 7l N 22 0, ,)• 

ImPyPy-Y-OpPyPy-p-Dp. ImPyPy-y-OpPyPy-p-Pam-Resin was synthesized in a 
stepwise fashion by machine-assisted solid phase methods from Boc-p-Pam-Resin (0.66 
mmol/g). Baird, E. E. & Dervan, P. B. describes the solid phase synthesis of polyarnides 

10 containing imidazole and pyrrole amino acids. J. Am. Chem. Soc. 118, 6141-6146 (1996); also 
see PCT US 97/003332. 3-hydroxypyrrole-Boc-amino acid (0.7 mmol) was incorporated by 
placing the amino acid (0.5 mmol) and HBTU (0.5 mmol) in a machine synthesis cartridge. 
Upon automated delivery of DMF (2 mL) and DIEA (I mL) activation occurs. A sample of 
ImPyPy-y-OpPyPy-p-Pam-Resin (400 mg, 0.45 mmol/gram) was placed in a glass 20 mL 

15 peptide synthesis vessel and treated with neat dimethylaminopropylamine (2 mL) and heated 
(55 °C) with periodic agitation for 16 h. The reaction mixture was then filtered to remove resin, 
0.1% (wt/v) TFA added (6 mL) and the resulting solution purified by reversed phase HPLC. 
ImPyPy-y-OpPyPy-p-Dp is recovered upon lyophilization of the appropriate fractions as a 
white powder (45 mg, 25% recovery). UV (H 2 0) ^ 246, 310 (50,000); 'H NMR (DMSO-4,) 

20 6 10.45 (s, 1 H), 9.90 (s, 1 H), 9.82 (s, 1 H), 9.5 (br s, 1 H), 9.38 (s, 1 H), 9.04 (s, 1 H), 8.02 (m, 
3 H), 7.37 (s, 1 H), 7.25 (m, 2 H), 7.15 (d, 1 H,J= 1.6 Hz), 7.1 1 (m, 2 H), 7.09 (d, 1 H), 7.03 
(d, 1 H), 6.99 (d, 1 H), 6.87 (d, 1 H), 6.84 (d, 1 H), 3.96 (s, 3 H), 3.81 (s, 6 H), 3.77 (s, 6 H), 
3.76 (s, 3 H), 3.74 (s, 1 H), 3.34 (q, 2 H, .7=5.6 Hz), 3.20 (q, 2 H,J~ 5.8 Hz), 3.09 (q, 2 H, J = 
6.1 Hz), 2.97 (quintet, 2 H,J- 5.3 Hz), 2.70 (d, 6 H, J= 3.9 Hz), 2.34 (m, 4 H), 1.73 (m, 4 H); 

25 MALDI-TOF-MS (monoisotopic), 1007.6 (1007.5 calc. for C 48 H 6 3N 16 0 9 ). 

ImPyPy-y-HpPyPy. In order to remove the methoxy protecting group, a sample of 
ImPyPy-y-OpPyPy-p-Dp (5 mg, 4.8 umol) was treated with sodium thiophenoxide at 100 °C 
for 2 h. DMF (1000 uL) and thiophenol (500 uL) were placed in a (13 x 100 mm) disposable 

30 Pyrex screw cap culture tube. A 60 % dispersion of sodium hydride in mineral oil ( 1 00 mg) was 
slowly added. Upon completion of the addition of the sodium hydride, ImlmPyPy-y-ImOpPyPy- 
P-Dp (5 mg) dissolved in DMF (500 uL) was added. The solution was agitated, and placed in a 
100 °C heat block, and deprotected for 2 h. Upon completion of the reaction the culture tube 
was cooled to 0°C, and 7 ml of a 20 % (wt/v) solution of trifluoroacetic acid added. The 

35 aqueous layer is separated from the resulting biphasic solution and purified by reversed phase 
HPLC. ImlmHpPy-y-ImHpPyPy-p-Dp is recovered as a white powder upon lyophilization of 
the appropriate fractions (2.5 mg, 52 % recovery). UV (H 2 0) 7^ x 246, 310 (50,000); ! H NMR 
(DMSO-40 5 10.44 (s, 1 H), 10.16 (s, 1 H), 9.90 (s, 1 H), 9.77 (s, 1 H), 9.5 (br s, 1 H), 9.00 (s, 
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1 H), 8.03 (in, 3 H), 7.37 (s, 1 H), 7.26 (m, 2 H), 7.14 (d, 1 H, 7= 1.7 Hz), 7.12 (m, 2 H), 7.02 
(d, I H), 6.93 (d, 1 H), 6.88 (d, 1 H), 6.82 (d, 1 H), 6.72 (d, 1 H), 3.96 (s, 3 H), 3.81 (s, 6 H), 
3.77 (s, 3 H), 3.76 (s, 3 H), 3.74 (s, 1 H), 3.36 (q, 2 H, 7= 5.4 Hz), 3.22 (q, 2 H, / = 5.9 Hz), 
3.09 (q, 2 H, J= 5.5 Hz), 2.98 (quintet, 2 H, / = 5.3 Hz), 2.70 (d, 6 H, /= 4.3 Hz), 2.34 (m, 4 
5 H), 1 .78 (m, 4 H); MALDI-TOF-MS (monoisotopic), 994.2 (993.5 calc. for C 47 H 6 ,N, 6 0 9 ). 

Table 5. Mass spectral characterization of Op and Hp polyaraides, synthesized and purified as 

described for ImlmOpPy-T-ImPyPyPy-p-Dp and ImlmHpPy-y-rmPyPyPy-P-Dp. 

POLYAMTOE FORMULA (M+H)CALCD FOUND 



10 


ImOpPy-y-PyPyPy-p~Dp 


C48H 6 3N 16 0 9 


1007.5 


1007.5 




ImHpPy-y-PyPyPy-p-Dp 


C 47 H 61 N 16 0 9 


993.5 


993.2 




ImPyOp-y-PyPyPy-P-Dp 


C 48 H 63 Ni 6 09 


1007.5 


1007.5 




ImPyHp-y-PyPyPy-p-Dp 


C 4 7H 6 ,N l6 0 9 


993,5 


993.4 




ImPyPy-y-OpPyPy-p-Dp 


C 48 H 63 N 16 0 9 


1007.5 


1007.6 


15 


ImPyPy-y-HpPyPy-p-Dp 


C 47 H 61 N 16 0 9 


993.5 


993.2 




ImPyPy-y-PyOpPy-P-Dp 


C 48 H 63 N l6 0 9 


1007.5 


1007.5 




ImPyPy-y-PyHpPy-P^Dp 


C 47 H 61 N l6 0 9 


993.5 


993.4 




ImOpOp-y-PyPyPy-p-Dp 


C 4 9H 6 5N 16 O) 0 


1037.5 


1037.5 




ImHpHp-y-PyPyPy-p-Dp 




1009.5 


1009.4 


20 


ImlmOpPy-y-ImPyPyPy-p-Dp 


C58H72N 2 20 H 


1253.6 


1253.5 




ImlmHpPy-y-ImPyPyPy-p-Dp 


C5 7 H7iN220n 


1239.6 


1239.6 




ImlmPyPy-y-ImOpPyPy-p-Dp 


C58H 7 2N 2 20n 


1253.6 


1253.6 




ImlmPyPy-y-ImHpPyPy-p-Dp 


C 5 7H7lN 2 20 tl 


1239.6 


1239.6 




ImOpPyPy-y-ImOpPyPy-p-Dp 


C60H76N21O12 


1282.6 


1282.6 


25 


ImHpPyPy-y-ImHpPyPy-p-Dp 


C5 8 H7 2 N 21 Oi2 


1254.6 


1254.6 




ImlmOpPy-y-ImOpPyPy-p-Dp 


C5 9 H 75 N220l2 


1283.6 


1283.6 




ImlinHpPy-y-ImHpPyPy-p-Dp 


C57H 71 N 22 Ot 2 


1255.6 


1255.5 




ImOpPyPy-y-PyPyPyPy-p-Dp 


C6oH 75 N 20 Ou 


1251.6 


1251.5 




ImPyPyPy-y-PyPyOpPy-p-Dp 


C6oH 7 5N 2 oOn 


1251.6 


1251.5 


30 


ImlmPyPy-y-ImPyOpPy-p-Dp 


C58H 72 N 22 On 


1253.6 


1253.7 




ImOpPyPyPy-y-ImOpPyPyPy-p-Dp 


^7^88^50) 4 


1526.7 


1526.6 




ImHpPyPyPy-y-ImHpPyPyPy-p-Dp 


C7oH 84 N 25 0 14 


1498.7 


1498.0 




ImlmPyPyPy-y-ImOpOpPyPy-p-Dp 


QnHg 7 N 26 0t 4 


1527.7 


1527.7 
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EXAMPLE 3: 
DETERMINATION OF POLYAMIDE 
BINDING AFFINITY AND SEQUENCE SPECIFICITY, 



Representative-footprint titration experiments are shown in Figures 3 and 10. A 252-bp 
DNA fragment which is typically used for the footprint titration experiments provides 247 
possible 6-bp binding sites for an eight-ring hairpin poiyamide. Thus, in addition to providing 
DNA binding affinities, the footprint titration experiments also reveal DNA binding sequence- 
specificity. The DNA binding sequence specificity of polyamides which differ by a single 
Py/Py, Hp/Py, or Py/Hp pair for sites which differ by a single A # T or T*A base pair are 
described in Tables 1, 6, and 7. 



Quantitative DNase I Footprint Titrations All reactions were executed in a total volume 
of 400 y.L (Brenowitz, M. et al y 1986). A poiyamide stock solution or H 2 0 (for reference 
lanes) was added to an assay buffer containing 3'- 32 P radiolabeled restriction fragment (20,000 
cpm), affording final solution conditions of 10 mM Tris»HCl, 10 mM KCI, 10 mM MgCl 2 , 5 
mM CaCl 2 , pH 7.0, and either (i) a suitable concentration range of poiyamide, or (ii) no 
poiyamide (for reference lanes). The solutions were allowed to equilibrate for 24 hours at 22°C 
Footprinting reactions were initiated by the addition of 10 jiL of a stock solution of DNase I (at 
the appropriate concentration to give -55% intact DNA) containing 1 mM dithiothreitol and 
allowed to proceed for 7 minutes at 22°C. The reactions were stopped by the addition of 50 \xL 
of a solution containing 2.25 M NaCl, 150 mM EDTA, 23 |^M base pair calf thymus DNA, and 
0.6 mg/ml glycogen, and ethanol precipitated. The reactions were resuspended in 1 x TBE/ 
80% formamide loading buffer, denatured by heating at 85°C for 15 minutes, and placed on ice. 
The reaction products were separated by electrophoresis on an 8% polyacrylamide gel (5% 
crosslinking, 7 M urea) in 1 x TBE at 2000 V for 1.5 h. Gels were dried on a slab dryer and 
then exposed to a storage phosphor screen at 22°C. 

Photostimuable storage phosphor imaging plates (Kodak Storage Phosphor Screen 
SO230 obtained from Molecular Dynamics) were pressed flat against dried gel samples and 
exposed in the dark at 22°C for 12-24 hours. A Molecular Dynamics 400S Phosphorlmager 
was used to obtain all data from the storage screens (Johnston et al., 1990). The data were 
analyzed by performing volume integration of the target site and reference blocks using the 
ImageQuant v. 3.3 software running on a Compaq Pentium 80. 

Quantitative DNase I Footprint Titration Data Analysis was performed by taking a 
background-corrected volume integration of rectangles encompassing the footprint sites and a 
reference site at which DNase I reactivity was invariant across the titration generated values for 
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10 



15 



the site intensities (I site ) and the reference intensity (I ref ). The apparent fractional occupancy 
(6ap P ) of the sites were calculated using the equation: 

ft - 1 />W/r * 

/.«.°//rtf° U ' 

where I sitc ° and I ref ° are the site and reference intensities, respectively, from a DNase I control 
lane to which no polyamide was added. 

The (Pkltot, 0 app) data were fit to a Langmuir binding isotherm (eq. 2, n=l) by 
minimizing the difference between e app and 9 fit , using the modified Hill equation: 

Ka " [L]" tot 



Oflt = Bmin + (0max - 9mln ) 



(2) 



1 + K. n [L] n u* 

where [L tot ] is the total polyamide concentration, K, is the equilibrium association constant, and 
0 min and 6 may are the experimentally determined site saturation values when the site is 
unoccupied or saturated, respectively. The data were fit using a nonlinear least-squares fitting 
procedure of KaleidaGraph software (v. 3.0.1, Abelbeck Software) with K,, e mx , and 0^ as the 
adjustable parameters. The goodness of fit of the binding curve to the data points is evaluated 
by the correlation coefficient, with R > 0.97 as the criterion for an acceptable fit. Four sets of 
acceptable data were used in determining each association constant. All lanes from a gel were 
used unless a visual inspection revealed a data point to be obviously flawed relative to 
neighboring points. The data were normalized using the following equation: 

Oapp ' Qmin 
"max - Umin 



20 



TABLE 6 Discrimination of 5'-TGTAA-3' and 5'-TGTTA-3'* 



Pair* 5'-TGTA/ 
5*-T G T J 

Py/Py fKKXX 

= 0.014 


i-y 5'-TGTT/ 
i A-3' 5'-T G T (l 

3\ moz 

y +xkxx 

PjT-5' 3*-A C a|j 
tuM K d = 0.00 


l-3' K„at 
t A-3* 

iP 2.0 
yx-5* 


5'-T G I (1 

moc 


^-3* 5'-T G *fi 

CjT-5' 3'-A C A@ 
fAM £ ri s0„56 


el a-3' 

r 0.36 
uM 


5'-T G Tp 
Hp/Py -eKHDOC 

3'-A C A 15 
K d = 4.0u 


□ A^3 f 5'-T GT|3 

*\ mot 

y +>oooc 

Pjtr-5* 3 -A C A{| 

M ^0.28 


flA-3' 

5> ,4 

uM 



"The reported equilibrium dissociation constants are the mean values 
obtained from two DNase I footprint titration experiments on a 3* 
labeled 370-bp pDEHl EcomfPvull DNA restriction fragment 13 . The 
assays were earned out at 22 *C, pH 7.0 in the presence of 10 mM 
Tris-HCl, 10 mM KCI, 10 mM MgClj, and 5 mM CaCfe. 
TRing pairing opposite T»A and A*T in the third position 
^Calculated as KJ5'-TGTAA-y)/K*(5'-TGTTA-3'), 
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TABLE 7 Discrimination of 5'-TGTTr-3' and 5'-TGATT-3" "' 
Pairt y-TGATT-y 5'-TCTTr-3* K^t 



Py/Py 



Hp/Py 




5'-T G 5C 



3*-A C 
^ d = 0.026 pM 



• PCX 

^KK>0CK 
3*-a cyyA A-5* 

tf d = 0.005 flM 



» T-3* 



5*~T G[£|<r T-3* 




5'-T G 



3*-A CJSIA A-5' 
/T d =0.53pM 



^KK>0CK 
3'-A C[a|a A-5' 

AT d = 0.008 jaM 



T T-3* 



5.2 



66 



Py/Hp 



5'-T G 





0.56 



"The reported equilibrium dissociation constants are the mean values 
obtained from two DNase I footprint titration experiments. The assays 
were earned out at 22 °C, pH 7.0 in the presence of 10 mM Tris-HCl 
10 mM KC1, 10 mM MgCl 2 , and 5 mM CaCK 
TRing pairing opposite T*A and A»T in the Surd position. 
♦Calculated as /C d (5'-TGATT -3')/K d (5'-TGTTT-30. 



EXAMPLE 5: 

5 PREPARATION OF A BIFUNCTIONAL Hp-Py-Im-POLYAMIDE. 

ImImOpPy-y-ImPyPyPy-$-Dp-NH2. ImImOpPy^ImPyPyPy-p-Pam«Resin was 
synthesized in a stepwise fashion by machine-assisted solid phase methods from Boc-p-Pam- 
Resin (0.66 mmol/g). Baird, E. E. & Dervan, P. B. describes the solid phase synthesis of 

10 polyamides containing imidazole and pyrrole amino acids. J. Am. Chem. Soc. 118, 6141-6146 
(1996); also see PCT US 97/003332. 3-hydroxypyrrole-Boc-amino acid (0.7 mmol) was 
incorporated by placing the amino acid (0,5 mmol) and HBTU (0.5 mmol) in a machine 
synthesis cartridge. Upon automated delivery of DMF (2 mL) and DIEA (1 mL) activation 
occurs. A sample of ImlmOpPy-y-ImPyPyPy-p-Pam-Resin (400 mg, 0.40 mmol/gram) was 

15 placed in a glass 20 mL peptide synthesis vessel and treated with neat 3,3 , -diamino-AT- 
methyldipropylamine (2 mL) and heated (55 °C) with periodic agitation for 16 h. The reaction 
mixture was then filtered to remove resin, 0.1% (wt/v) TFA added (6 mL) and the resulting 
solution purified by reversed phase HPLC. ImImOpPy. Y -ImPyPyPy-p-Dp-NH2 is recovered 
upon lyophilization of the appropriate fractions as a white powder (93 mg, 46% recovery). UV 

20 (H 2 0) X m *x 246, 316 (66,000); ! H NMR (DMSO-rftf) 5 10.34 (s, 1 H), 10.30 (br s, 1 H), 10.25 
(s, 1 H), 9.96 (s, 1 H), 9.95 (s, 1 H), 9.89 (s, 1 H), 9.24 (s, 1 H), 9.1 1 (s, 1 H), 8.08 (t, 1 H, J = 
5.6 Hz), 8.0 (m, 5 H), 7.62 (s, 1 H), 7.53 (s, 1 H), 7.42 (s, 1 H), 7.23 (d, 1H, J= 1.2 Hz), 7.21 
(m, 2 H), 7.15 (m, 2 H), 7.13 (d, 1 H), 7.1 1 (m, 2 H), 7.04 (d, 1 H), 6.84 (m, 3 H), 3.98 (s, 3 H), 
3.97 (s, 3 H), 3.92 (s, 3 H), 3.82 (m, 6 H), 3.80 (s, 3 H), 3.77 (d, 6 H), 3.35 (q, 2 H,7= 5.8 Hz) 

25 3.0-3.3 (m, 8 H), 2.86 (q, 2 H, J = 5.4 Hz), 2.66 (d, 3H,y= 4.5 Hz), 2.31 (m, 4 H), 1.94 
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(quintet, 2H,7= 6.2 Hz), 1.74 (m, 4 H); MALDI-TOF-MS (monoisotopic), 1296.0 (1296.6 
calc. forC60H78N230n). 

ImlmOpPy-y-ImPyPyPy-V-Dp-EDTA. Excess EDTA-dianhydride (50 mg) was dissolved 
in DMSO/NMP (1 mL) and DIEA (1 mL) by heating at 55 °C for 5 min. The dianhydride 
solution was added to ImImOpPy-y-IinPyPyPy-p-NH2 (13 mg, 10 umol) dissolved in DMSO 
(750 uL). The mixture was heated (55 °C, 25 min.) and the remaining EDTA-anhydride 
hydrolyzed (0.1M NaOH, 3 mL, 55 °C, 10 min). Aqueous TFA (0.1% wt/v) was added to 
adjust the total volume to 8 mL and the solution purified directly by reversed phase HPLC to 
provide ImlmOpPy-y-ImPyPyPy-p-Dp-EDTA as a white powder upon lyophilization of the 
appropriate fractions (5.5 mg, 40% recovery). MALDI-TOF-MS (monoisotopic), 1570.9 
(1 570.7 calc. for C70H92N25O1 8). 



ImlmHpPy-i-ImPyPyPy-V-Dp-EDTA. In order to remove the methoxy protecting group, 
a sample of ImlmOpPy-y-ImPyPyPy-p-Dp-EDTA (5 mg, 3.1 umol) was treated with sodium 
thiophenoxide at 100 °C for 2 h. DMF (1000 uL) and thiophenol (500 uL) were placed in a (13 
x 100 mm) disposable Pyrex screw cap culture tube. A 60 % dispersion of sodium hydride in 
mineral oil (100 mg) was slowly added. Upon completion of the addition of the sodium hydride, 
ImlmOpPy-y-ImPyPyPy-p-Dp-EDTA (5 mg) dissolved in DMF (500 uL) was added. The 
solution was agitated, and placed in a 100 °C heat block, and deprotected for 2 h. Upon 
completion of the reaction the culture tube was cooled to 0°C, and 7 ml of a 20 % (wt/v) 
solution of trifluoroacetic acid added. The aqueous layer is separated from the resulting biphasic 
solution and purified by reversed phase HPLC. IrnlmHpPy-y-ImPyPyPy-P-Dp-EDTA is 
recovered as a white powder upon lyophilization of the appropriate fractions (3.2 mg, 72 % 
recovery). UV (H 2 0) Xmax 246, 312 (66,000); MALDI-TOF-MS (monoisotopic), 1556.6 
(1556.7 calc. for C69H90N25O18). 



ImImPyPy-y-ImOpPyP y -V-Dp-NH2. InumPyPy-y-ImOpPyPy-p-Pam-Resin was 
synthesized in a stepwise fashion by machine-assisted solid phase methods from Boc-p-Pam- 
Resin (0.66 mmol/g). Baird, E. E. & Dervan, P. B. describes the solid phase synthesis of 
polyamides containing imidazole and pyrrole amino acids. J. Am. Chem. Soc. 118, 6141-6146 
(1996); also see PCT US 97/003332. 3-hydroxypyrrole-Boc-amino acid (0.7 mmol) was 
incorporated by placing the amino acid (0.5 mmol) and HBTU (0.5 mmol) in a machine 
synthesis cartridge. Upon automated delivery of DMF (2 mL) and DIEA (1 mL) activation 
occurs. A sample of IirdmPyPy-y-ImOpPyPy-P-Pam-Resin (400 mg, 0.40 mmol/gram) was 
placed in a glass 20 mL peptide synthesis vessel and treated with neat 3,3'-diamino-;V- 
methyldipropylamine (2 mL) and heated (55 °C) with periodic agitation for 16 h. The reaction 
mixture was then filtered to remove resin, 0.1% (wt/v) TFA added (6 mL) and the resulting 
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solution purified by reversed phase HPLC. ImImPyPy-y-ImOpPyPy-p-Dp-NH2 is recovered 
upon Iyophilization of the appropriate fractions as a white powder (104 mg, 54% recovery). UV 
(H2O) Xniax 246, 316 (66,000); MALDI-TOF-MS (monoisotopic), 1296.6 (1296.6 calc. for 
C6OH78N23O11). 

5 

ImImPyPy-y-ImOpPyPy-$-Dp-EDTA. Excess EDTA-dianhydride (50 mg) was dissolved 
in DMSO/NMP (1 mL) and DIEA (1 mL) by heating at 55 °C for 5 min. The dianhydride 
solution was added to ImIrnPyPy-y.ImOpPyPy-P-]Nffl2 (13 mg, 10 umol) dissolved in DMSO 
(750 uL). The mixture was heated (55 °C, 25 min.) and the remaining EDTA-anhydride 
10 hydrolyzed (0.1M NaOH, 3 mL, 55 °C> 10 min). Aqueous TFA (0.1% wt/v) was added to 
adjust the total volume to 8 mL and the solution purified directly by reversed phase HPLC to 
provide ImlmPyPy-y-ImOpPyPy-p-Dp-EDTA as a white powder upon lyophilization of the 
appropriate fractions (5.9 mg, 42% recovery). MALDI-TOF-MS (monoisotopic), 1570.8 
(1 570.7 calc. for C70H92N25O1 8). 

15 

ImImPyPy~y-ImHpPyPy~$-Dp-EDTA. In order to remove the methoxy protecting group, 
a sample of ImlmPyPy-y-ImOpPyPy-p-Dp-EDTA (5 mg, 3.1 umol) was treated with sodium 
thiophenoxide at 100 °C for 2 h. DMF (1000 u.L) and thiophenol (500 uL) were placed in a (13 
x 100 mm) disposable Pyrex screw cap culture tube. A 60 % dispersion of sodium hydride in 

20 mineral oil (100 mg) was slowly added. Upon completion of the addition of the sodium hydride, 
ImlmPyPy-y-ImOpPyPy-p-Dp-EDTA (5 mg) dissolved in DMF (500 uL) was added. The 
solution was agitated, and placed in a 100 °C heat block, and deprotected for 2 h. Upon 
completion of the reaction the culture tube was cooled to 0°C, and 7 ml of a 20 % (wt/v) 
solution of trifhioroacetic acid added. The aqueous layer is separated from the resulting biphasic 

25 solution and purified by reversed phase HPLC. IirdmPyPy-y-LnHpPyPy-p-Dp-EDTA is 
recovered as a white powder upon lyophilization of the appropriate fractions (3.2 mg, 72 % 
recovery). UV (H2O) A, max 246, 312 (66,000); MALDI-TOF-MS (monoisotopic), 1555.9 
(1 556.7 calc. for C69H90N25O1 8). 

30 EXAMPLE 6: 

DETERMINATION OF POLYAMIDE BINDING ORIENTATION 

Affinity cleavage experiments using hairpin polyamides modified with EDTA*Fe(II) at 
either the C-terminus or on the y-turn, were used to determine polyamide binding orientation 
35 and stoichiometry. The results of affinity cleavage experiments are consistent only with 
recognition of 6-bp by an 8-ring hairpin complex and rule out any extended 1:1 or overlapped 
complex formation. In addition, affinity cleavage experiments reveal hairpin formation 
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supporting the claim that it is the Hp/Py and Py/Hp pairing which form at both match and 
mismatch sites to discriminate A«T ftora T»A. 

Affinity cleavage reactions were executed in a total volume of 40 jiL. A stock solution of 
polyamide or H 2 0 was added to a solution containing labeled restriction fragment (20,000 
cpm), affording final solution conditions of 25 mM Tris-Acetate, 20 mM NaCl, 100 jiM/bp calf 
thymus DNA, and pH 7*0. Solutions were incubated for a minimum of 4 hours at 22°C. 
Subsequently, 4 of freshly prepared 100 FefNEUHSO^ was added and the solution 
allowed to equilibrate for 20 min. Cleavage reactions were initiated by the addition of 4 jiL of 
100 mM dithiothreitol, allowed to proceed for 30 min at 22 °C, then stopped by the addition of 
10 \xL of a solution containing 1.5 M NaOAc (pH 5.5), 0,28 mg/mL glycogen, and 14 base 
pairs calf thymus DNA, and ethanol precipitated. The reactions were resuspended in Ix 
TBE/80% formamide loading buffer, denatured by heating at 85 °C for 15 min, and placed on 
ice. The reaction products were separated by electrophoresis on an 8% polyacrylamide gel 
(5% cross-link, 7 M urea) in Ix TBE at 2000 V for 1.5 hours. Gels were dried and exposed to a 
storage phosphor screen. Relative cleavage intensities were determined by volume integration 
of individual cleavage bands using ImageQuant software. 
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EXAMPLE 7: 

IMPROVEMENT TO POLYAMIDE SEQUENCE SPECIFICITY. 



10 



15 



20 



The polyamidet of this invention provide improved specificity relative to existing 
polyamide technology. Turner, J. T„ Baird, E. E., and Dervan, P.B. describe the recognition of 
seven base pair sequences in the minor groove of DNA by ten-ring pyrrole^imidazole 
polyamide hairpins J. Am. Chem. Soc. 1997 77P, 7636. For example, quantitative DNasel 
footprint titrations reveal that the 10-ring hairpin ImPyPyPyPy-y-ImPyPyPyPy-p-Dp binds a 5*- 
TGTAACA-3- sequence with an equlibrium dissociation constant of 0.083 nM, and 18-fold 
specificity versus a single base mismatch site. A number of other sites are also bound on the 
252-bp DNA fragment used for the footprint titration experiments. (Figure 13). Introduction of 
a Hp/Py and Py/Hp pair in the 10-ring polyamide, ImHpPyPyPy-y-ImHpPyPyPy-p-Dp, to 
recognize a T*A and A-T within the 7-bp target sequence, increases the sequence-specificty. For 
example, a single base mismatch site 5*-TGGAACA-3 is discriminated by > 5000-fold (Figure 
13, Table 8). In fact all 245 7-bp mismatch sites present on the restriction fragment are 
discriminated > 5000-fold by the polyamide ImHpPyPyPy-y-ImHpPyPyPy-p-Dp (Figure 13). 
For cases where three A/T base pairs are present in succession it is preferred to substitute Py/Py 
in place of at least one Hp/Py or Py/Hp to provide for recognition of A-T and T*A at a single 
position. 



TABLE 8 Equilibrium dissociation constants* 



Polyamidet 



5--TGGTCA-3' 



5'-TGGACA-3' 



5'-T O T A &|C A-3 f 

• OOCCK 
Py/Py ^OOOC^ 

3'~A C [AJtItJo 
Af d ss 0.083 nM 



5*-T Gf 



IT A C A-3' 

+><K)pbo# ; 

3 '-A ClCjA|TjG V-B* 
# d = 1.5 nM 



18 



5'-X 6 filAlAlc A-3* 

Hp/Py +XK>0O®t ; 

3'-A C [aJT [tJo T-5' 
g d «0.2 nM 




|t[a)c A-3* 
A t a 



>5000 



A^_> 1000 nM 



*The reported dissociation constants are the average values obtained from three 
DNase I footprint titration experiments. The standard deviation for each data set is 
less than 15% of the reported number. Assays were carried out in the presence of 10 
mM Tris*HCL 10 mM KC1, 10 mM MgCl2, and 5 mM CaC12 at pH 7.0 and 22 °C 
TRing pairing opposite T*A and A*T in the fourth position. 
*Calculated as lC d (5'.TGGTACA-3')/K d (5'-TGTAACA-3')- 



EXA3MPLE 8: 
USE OF PAIRING CODE 

There are 256 possible four base pair combinations of A, T\ G, and Q Of these, there are 
a possible 240 four base pair sequences which contain at least 1 A»T or T»A base pair and 
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therefore can advantageously use an Hp/Py, or Py/Hp carboxamide binding. Polyamides 
binding to any of these sequences can be designed in accordance with the code of TABLE 2 
Table 9 lists the sixteen eight-ring hairpin polyamides (1-16) which recognize the sixteen 5'- 
WGTNNW-3' sequences (W = A or T, X = A, G, C, or T). Table 10 lists the sixteen eight-ring 
hairpm polyamides (17-32) which recognize the sixteen 5'-WGANNW-3* sequences (17-32). 
Table 1 1 lists the twelve eight-ring hairpin polyamides (33-44) which recognize twelve 5'- 
WGGNNW-3' sequences which contain at least one A,T base pair. Table 1 1 lists the four eight- 
ring hairpin polyamides (G1-G4) which target the four 5'-WGGNNW-3' sequences (G1-G4) 
which contain exclusively G-C base pairs. Table 12 lists the twelve eight-ring hairpin 
polyamides (45-56) which recognize twelve 5'-WGCNNW-3' sequences which contain at least 
one A,T base pair. Table 12 lists the four eight-ring hairpin polyamides (G5-G8) which target 
the four 5--WGCNNW-3* sequences (G5-G8) which contain exclusively G-C base pairs. Table 
13 lists the sixteen eight-ring hairpin polyamides (57-72) which recognize the sixteen 5'- 
WTTNNW-3' sequences (57-72). Table 14 lists the sixteen eight-ring hairpin polyamides (73- 
88) which recognize the sixteen 5'-WTANNW-3' sequences (73-88). Table 15 lists the sixteen 
eight-ring hairpin polyamides (89-104) which recognize the sixteen 5'-WTGNNW-3' sequences 
(89-104). Table 16 lists the sixteen eight-ring hairpin polyamides (105-120) which recognize 
the sixteen 5'-WTCNNW-3' sequences (105-120). Table 17 lists the sixteen eight-ring hairpin 
polyamides (121-136) which recognize the sixteen 5'-WATNNW-3' sequences (121-136). 
Table 18 lists the sixteen eight-ring hairpin polyamides (137-152) which recognize the sixteen 
5 '-WAANNW-3 ' sequences (137-152). Table 19 lists the sixteen eight-ring hairpin polyamides 
(153-168) which recognize the sixteen S'-WAGNNW-3' sequences (153-168). Table 20 lists 
the sixteen eight-ring hairpin polyamides (169-184) which recognize the sixteen 5'-WACNNW- 
3' sequences (169-184). Table 21 lists the sixteen eight-ring hairpin polyamides (185-200) 
which recognize the sixteen 5'-WCTNNW-3' sequences (185-200). Table 22 lists the sixteen 
eight-ring hairpin polyamides (201-216) which recognize the sixteen 5'-WCANNW-3' 
sequences (201-216). Table 23 lists the twelve eight-ring hairpin polyamides (217-228) which 
recognize the twelve 5'-WCGNNW-3' sequences which contain at least one A,T base pair. 
Table 23 lists the four eight-ring hairpin polyamides (G9-G12) which target the four 5'- 
WCGNNW-3' sequences (G9-G12) which contain exclusively OG base pairs. Table 24 lists 
the twelve eight-ring hairpin polyamides (229-240) which recognize the twelve 5'-WCCNNW- 
3' sequences which contain at least one A,T base pair. Table 24 lists the four eight-ring hairpin 
polyamides (G13-G16) which target the four 5'-WCCNNW-3' sequences (G13-G16) which 
contain exclusively OG base pairs. 
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TABLE 9: 8-ring Hairpin Polyamidcs for recognition of 6-bp S^WGTNNW^' 



DNA sequence aromatic amino acid sequence 



1) 


5' 


-W 


G 


T 


T 


T 


W-3 1 


1) ImHr>Hr»Hr>-V- PvPvPvPv 


2) 


5' 


-W 


G 


T 


T 


A 


wr-3* 


2 ) ImHpHn Pv - Y- HpPvPvPv 


3) 


5' 


-w 


6 


T 


T 


Q 


W-3' 


3 ) ImHpHp Im-y- PyPyPyPy 


4) 


5' 


-w 


G 


T 


T 


C 


W-3* 


4 ) ImHpHpPy-y- ImPyPyPy 


5) 


5 1 




G 


T 


A 


T 


W-3' 


5) ImHpPyHp ~y- PyHpPyPy 


6) 


5 1 


-w 


G 


T 


A 


A 


W-3» 


6) ImHpPy Py - y - HpHp Py Py 


7) 


5' 


-w 


G 


T 


A 


G 


W-3' 


7) IrnHpPylm-y-PyHpPyPy 


8) 


5' 


-w 


G 


T 


A 


C 


W-3* 


8) IrnHpPyPy-y- ImHpPyPy 


9) 


5 1 


-w 


G 


T 


G 


T 


W-3 • 


9) ImHpImHp -y- PyPyPyPy 


10) 


5' 


-w 


G 


T 


G 


A 


W-3» 


10) ImHpImPy-y-HpPyPyPy 


11) 


5' 


-w 


G 


T 


G 


G 


W-3» 


11) I mHpImlm-y- PyPyPyPy 


12) 


5' 


-w 


G 


T 


G 


C 


W-3' 


12 ) ImHpImPy-y- ImPyPyPy 


13) 


5' 


-w 


G 


T 


C 


T 


W-3 1 


13) ImHpPyHp -y - Py ImPy Py 


14) 


5 


-w 


G 


T 


C 


A 


W-3« 


14 ) ImHpPyPy-y-HpImPyPy 


15) 


5 


-w 


G 


T 


C 


G 


W-3 ■ 


15) ImHpPylm-y- PylmPyPy 


16) 


5 


• -w 


G 


T 


C 


C 


W-3» 


16 ) ImHpPyPy-y- ImlmPyPy 
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TABLE 10: 8-ring Hairpin Polyamidcs for recognition of 6-bp 5*-WGANNW-3» 



DNA sequence aromatic amino acid sequence 



5 


17) 


5' 


-W 


6 


A 


T 


T 


W-3 ' 


17) ImPyHpHp-y-PyPyHpPy 




18) 


5' 


-W 


6 


A 


T 


A 


W-3' 


18) ImPyHp Py - y - HpPyHp Py 


in 


19) 


5' 


-w 


6 


A 


T 


G 


W-3 1 


19) ImPyHplm-y~ PyPyHpPy 




20) 


5' 


-w 


G 


A 


T 


C 


W-3" 


20) ImPyHpPy-y- ImPyHpPy 




21) 


5' 


-w 


6 


A 


A 


T 


W-3 ■ 


21) ImPyPyHp-y-PyHpHpPy 


15 


22) 


5' 


-w 


6 


A 


A 


A 


W-3« 


22) ImPy Py Py - y - HpHpHp Py 




23) 


5' 


-w 


G 


A 


A 


G 


W-3» 


23) ImPyPylrn-y-PyHpHpPy 


20 


24) 


5' 


-w 


G 


A 


A 


C 


W-3' 


24) ImPyPyPy-y-ImHpHpPy 




25) 


5 


-w 


G 


A 


G 


T 


W-3 • 


25) ImPy ImHp - y - Py PyHp Py 




26) 


5 1 


-w 


G 


A 


G 


A 


W-3 ■ 


26) ImPylmPy-y-HpPyHpPy 


25 


27) 


5' 


-w 


G 


A 


G 


G 


W-3' 


27) ImPylmlm-y- PyPyHpPy 




28) 


5 1 


-w 


G 


A 


G 


C 


W-3' 


28) ImPylmPy-y-ImPyHpPy 


30 


29) 


5 


-w 


G 


A 


C 


T 


W-3' 


29) ImPyPyHp -y- PylmHpPy 




30) 


5 


-w 


G 


A 


C 


A 


W-3 ' 


30) ImPyPyPy-y-HpImHpPy 




31) 


5 


-w 


G 


A 


C 


G 


W-3' 


31) ImPyPy Im-y- PylmHpPy 


35 


32) 


5 


-w 


G 


A 


C 


C 


W-3 ■ 


32) ImPyPyPy-y-ImlmHpPy 
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TABLE 11: 8-ring Hairpin Polyamides for recognition of 6-bp 5*-WGGNNW-3' 









DNA 


sequence 


aromatic amino acid sequence 


5 


j j / 




-W 


G 


G 


T 


T 


W~3' 


3 3 ) ImlmHpHp -y - PyPyPyPy 








-W 


G 


G 


T 


A 


W-3' 


34) ImlmHpPy-y-HpPyPyPy 


10 




c 


-w 


G 


G 


T 


G 


W-3' 


35) ImlmHpIm-y-PyPyPyPy 


oO } 


D 


-w 


G 


G 


T 


C 


W-3' 


36) ImlmHpPy-y- ImPyPyPy 




37) 


5 


-w 


G 


G 


A 


T 


W-3' 


37) ImlmPyHp -y - PyHpPyPy 


15 


38) 


5 


-w 


G 


G 


A 


A 


W-3' 


38) ImlmPyPy -y-HpHpPyPy 




39) 


5' 


~w 


G 


G 


A 


G 


W-3' 


39) ImlmPylm-y-PyHpPyPy 


20 


40) 


51 


-w 


G 


G 


A 


C 


W-3' 


40) ImlmPyPy -y- ImHpPyPy 


41) 


5 1 


-w 


G 


G 


G 


T 


W-3' 


41) ImlmlmHp -y- PyPyPyPy 




42) 


5' 


-w 


G 


G 


G 


A 


W-3' 


42) ImlmlmPy-y-HpPyPyPy 


25 


43) 


5' 


-w 


G 


G 


C 


T 


W-3' 


43 ) ImlmPyHp -y-PylmPyPy 




44) 


5' 


-w 


G 


G 


C 


A 


W-3' 


44) ImlmPyPy-y-HpImPyPy 


30 


GX) 


5' 


-w 


G 


G 


G 


G 


W-3' 


Gl) Imlmlmlm-y-PyPyPyPy 


G2) 


5' 


-w 


G 


G 


G 


C 


W-3* 


G2 ) ImlmlmPy-y- ImPyPyPy 




G3) 


5' 


-w 


G 


G 


C 


G 


W-3' 


G3) ImlmPylm-y-PylmPyPy 


35 


G4) 


5' 


-w 


G 


G 


C 


C 


W-3' 


G4 ) ImlmPyPy -y- ImlmPyPy 
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TABLE 12: 8-ring Hairpin Polyamides for recognitioa of 6-bp 5'-WGCNNW-3' 
DNA sequence aromatic amino acid sequence 



5 


45) 


5' 


-W 


Q 


C 


T 


T 


W-3' 


45) IroPyHpHp-y-PyPylmPy 




46) 


5' 


-W 


G 




T 


A 


W-3* 


46) ImPyHpPy-y-HpPylmPy 




47) 


5' 


-w 


6 


c 


T 


6 


W-3 ' 


47) ImPyHpIm-y-PypylmPy 




48) 


5' 


-w 


6 


c 


T 


C 


W-3' 


48) ImPyHpPy-y- ImPy ImPy 




49) 


5" 


-w 


6 


c 


A 


T 


W-3 1 


49) ImPyPyHp-y-PyHpImPy 


15 


50) 


5' 


-w 


G 


c 


A 


A 


W-3' 


50) ItnPyPyPy-y-HpHplmPy 




51) 


5' 


-w 


6 


c 


A 


G 


W-3" 


51) ImPyPylm-y-PyHpImPy 


20 


52) 


5' 


-w 


6 


c 


A 


C 


W-3 1 


52) ImPyPyPy-y-lmHpImPy 




53) 


5' 


-w 


6 


c 


6 


T 


W-3' 


53) ImPy ImHp-y~PyPy ImPy 




54) 


5' 


-w 


G 


c 


G 


A 


W-3' 


54) ImPy ImPy-y-HpPy ImPy 


25 


55) 


5 1 


-w 


6 


c 


C 


T 


W-3« 


55) ImPy PyHp -y- Py ImlmPy 




56) 


5 1 


-w 


G 


c 


C 


A 


W-3« 


56) ImPyPyPy-y-HpImlmPy 


30 


G5) 


5 


» -w 


G 


c 


G 


G 


W-3 ' 


G5) ImPy Imlm-y- PyPy ImPy 




66) 


5 


■ -w 


6 


c 


6 


C 


W-3- 


G6) ImPy ImPy-y- ImPy ImPy 




G7) 


5 




6 


c 


C 


G 


W-3' 


G7) ImPyPy Im-y- PylmlmPy 


35 


68) 


5 


■ -w 


G 


c 


C 


C 


W-3' 


G8) ImPyPyPy-y-lmlmlmPy 
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TABLE 13: 8-ring Hairpin Polyamides for recognition of 6-bp 5MYTTNNW~3' 









DNA sequence 


aromatic amino acid sequence 


5 


57) 


5 


'-W T 


T T T W-3 r 


57) HpHpHpHp -y ~ PyPyPyPy 




58) 


5 


'-W T 


T T A W-3 ■ 


58) HdHdHdPv -V- HdPvPvPv 


10 


59) 


5 


'-W T 


T T G W-3 • 


59) HpHpHpIm-y- PyPyPyPy 




60) 


5 


»-W T 


T T C W-3 1 


60 ) HpHpHpPy-y- ImPyPyPy 




61) 


5 


'-W T 


TAT W-3 " 


6 1 ) HpHpPyHp -y - PyHpPyPy 


15 


62) 


5 


» -W T 


T A A W-3 » 


62) HpHpPyPy-y-HpHpPyPy 




63) 


5 


*~W T 


TAG W-3 ■ 


63 ) HnHnPvTm-v- PvHnPvPv 


20 


.64) 


5 


T 


T A C W-3 ' 


64 ) HpHpPyPy-y- ImHpPyPy 




65) 


5 


• _w T 


T G T W-3 1 


65 ) HpHpImHp-y- PyPyPyPy 




66) 


5» 


-W T 


T G A W-3" 


66 ) HpHpImPy-y-HpPyPyPy 


25 


67) 


5* 


-W T ' 


T G G W-3' 


67 ) HpHpImlm-y- PyPyPyPy 




68) 


5* 


-W T ' 


T G C W-3» 


68) HpHp ImPy -y - ImPy Py Py 


30 


69) 


5» 


-W T 


X C T W-3* 


69) HpHpPyHp-y- PylmPyPy 




70) 


5* 


-W T 


T C A W-3" 


70) HpHpPyPy-y-HpImPyPy 




71) 


5' 


~W T 


T C G W-3« 


71) HpHpPylm-y- PylmPyPy 


35 


72) 


5» 


-W T 


T C C W-3 1 


72 ) HpHpPyPy-y- ImlmPyPy 
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TABLE 14: 8-ring Hairpin Polyamides for recognition of 6-bp S^WTANNW-3' 



DNA sequence aromatic amino acid sequence 



5 


73) 


5« 


-W 


T 


A 


T 


T 


W-3' 


73 ) HpPyHpHp ~ y - Py PyHpPy 




74) 


5' 


-w 


T H 


T 


A 


W-3 f 


74 ) Hp PyHpPy -y- HpPyHpPy 


10 


75) 


5' 


-w 


T 


A 


T 


G 


W-3 f 


75 ) HpPyHpIm-y ~ PyPyHpPy 




76) 


5' 


-w 


T 


A 


T 


C 


W-3 f 


7 6 ) HpPyHpPy-y- I m PyHpPy 




77) 


5* 


-w 


T 


A 


A 


T 


W-3» 


7 7 ) HpPyPyHp -y - PyHpHpPy 


15 


78) 


5 1 


-w 


T 


A 


A 


A 


W-3 f 


78 ) HpPyPyPy-y-HpHpHpPy 




79) 


5' 


-w 


T 


A 


A 


G 


W-3' 


7 9 ) HpPyPy Im-y- PyHpHpPy 


20 


80) 


5' 


-w 


T 


A 


A 


C 


W-3« 


8 0 ) HpPyPyPy-y-I mHpHp Py 




81) 


5" 


-w 


T 


A 


G 


T 


W-3* 


8 1 ) Hp Py ImHp-y - Py PyHpPy 




82) 


5' 


-w 


T 


A 


6 


A 


W-3* 


82 ) Hp PylmPy-y-Hp PyHpPy 


25 


83) 


5' 


-w 


T 


A 


6 


G 


W-3* 


83 ) HpPylmlm-y-PyPyHpPy 




84) 


5' 


r -w 


T 


A 


6 


C 


W-3* 


84 ) HpPylmPy-y- ImPyHpPy 


30 


85) 


5' 




X 


A 


C 


T 


W-3' 


8 5 ) HpPyPyHp - y - PylmHpPy 




86) 


5 


' -w 


T 


A 


C 


A 


W-3 1 


86)HpPyPyPy-y~HpImHpPy 




87) 


5 


» -w 


T 


A 


C 


G 


W-3* 


87) HpPyPylm-y-PylmHpPy 


35 


88) 


5 


» -w 


T 


A 


C 


C 


W-3* 


8 8 ) HpPy PyPy -y - ImlmHpPy 
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TABLE 15: 8-ring Hairpin Polyamidcs for recognition of 6-bp 5*-WTGNNW-3' 



DNA sequence aromatic amino acid sequence 



5 


89) 


5' 


-W 


T 


G 


T 


T 


W-3' 


89) 


HplmHpHp-y- PyPyPypy 




90) 


5' 


-w 


T 


~G 


T 


A 


W-3' 


90) 


Hp IittHp Py - y - Hp Py py Py 


10 


91) 


5' 


-w 


T 


G 


T 


G 


W-3» 


91) 


Hp ImHp Im -y- PyPyPyPy 


92) 


5' 


-w 


T 


G 


T 


C 


W-3' 


92) 


HpImHpPy-y- ImPyPyPy 




93) 


5' 


-w 


T 


G 


A 


T 


W-3' 


93) 


HpImPyHp -y- PyHpPyPy 


15 


94) 


5' 


-w 


T 


G 


A 


A 


W-3' 


94) 


Hp ImPy Py -y ~ HpHpPyPy 




95) 


5 1 


-w 


T 


G 


A 


G 


W-3' 


95) 


HpImPylm-y- PyHpPyPy 


20 


96) 


5' 


-w 


T 


G 


A 


C 


W-3' 


96) 


HpImPyPy-y- IrnHpPyPy 


97) 


5' 


-w 


T 


G 


G 


T 


W-3' 


97) 


Hp Im ImHp -y- PyPyPyPy 




98) 


5' 


-w 


T 


G 


G 


A 


W-3 ' 


98) 


HpImlmPy-y-HpPyPyPy 


25 


99) 


5' 


-w 


T 


G 


C 


T 


W-3" 


99) 


HpImPyHp-y- PylmPyPy 




100) 


5« 


-w 


T 


G 


C 


A 


W-3 • 


100) 


Hp ImPy Py -y - Hp ImPyPy 


30 


101) 


5' 


-w 


T 


G 


G 


G 


W-3' 


101) 


Hp Imlmlm - y - Py Py Py Py 




102) 


5 1 


-w 


T 


G 


G 


C 


W-3- 


102) 


Hp ImlmPy -y - ImPy Py Py 




103) 


5 


-w 


T 


G 


C 


G 


W-3' 


103) 


Hp ImPy Im- y - PylmPyPy 


35 


104) 


5 


-w 


T 


G 


C 


C 


W-3' 


104) 


HpImPyPy-y- ImlmPyPy 
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TABLE 16: 



8-ring Hairpin Polyamides for recognition of 6-bp 5*-WTCNNW~3' 







DNA sequence 




aromatic amino acid sequence 


5 


105) 


5' 


-W 


T 


C 


T 


T 


W-3 ' 


105) HpPyHpHp-y- PyPylmPy 




106) 


5' 


-w 


T 


C 


T 


A 


W-3 » 


a v w / xx^j tr y Cl^ftry J rtjJ tr y -LIMIT V 


10 


107) 


5' 


-w 


T 


C 


T 


G 


W-3« 


107 ) HpPyHpIm-y- PyPylmPy 


108) 


5" 


-w 


T 


C 


T 


C 


W-3» 


108) HpPyHpPy-y- ImPylmPy 




109) 


5" 




T 


c 


A 


T 


W-3" 


109)HpPyPyHp-y-PyHpIraPy 


15 


110) 


5' 


-w 


T 


c 


A 


A 


W~3» 


110) HpPyPyPy-y-HpHpImPy 




111) 


5' 


-w 


T 


c 


A 


G 




JL±±) nptryfyLm-j- FyripXTUPy 


20 


112) 


5' 


-w 


T 


c 


A 


C 


W-3' 


112 ) HpPyPyPy-y- ImHpImPy 




113) 


5 1 




T* 

X 


r» 


a 


T 
X 


W-3' 


113 ) HpPylmHp-y- PyPylmPy 




114) 


5' 


-w 


T 


c 


G 


A 


W-3' 


114) HpPylmPy-y- HpPy imPy 


25 


115) 


5' 


-w 


T 


c 


C 


T 


W-3* 


115 ) HpPyPyHp-y-PylmlraPy 




116) 


5 


-w 


T 


c 


C 


A 


W-3' 


116) HpPyPyPy-y-HpImlmPy 


30 


117) 


5 


-w 


T 


c 


G 


G 


W-3» 


117) HpPylmlm-y- PyPylmPy 




118) 


5 


-w 


T 


c 


G 


C 


W-3' 


118 ) HpPylmPy-y- ImPylmPy 




119) 


5 


» -w 


T 


c 


C 


G 


W-3' 


1 1 9 ) HpPyPylm-y- Py IralmPy 


35 


120) 


5 


i ~w 


T 


c 


C 


C 


W-3' 


120 ) HpPyPyPy-y- imlmlmPy 
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TABLE 17: 8-ring Hairpin Polyamides for recognirion of 6-bp 5MATATNNW-3* 



DNA sequence 



10 



15 



20 



25 



30 



35 



121) 


5' 


-W 


A 


T 


T 


T 


W- 


3' 


12 1 ) PyHpHpHp -y-PyPyPyHp 


122) 


5' 


-W 


A 


T 


T 


A 


W- 


3» 


122) PyHpHpPy-y-HpPy PyHp 


123) 


5' 


-w 


A 


T 


T 


G 


W- 


3' 


123) PyHpHpIm-y- PyPy PyHp 


124) 


5' 


-w 


A 


T 


T 


C 


W- 


3» 


124) PyHpHpPy-y-ImPyPyHp 


125) 


5« 


-w 


A 


T 


A 


T 


W- 


3' 


125) PyHpPyHp-y-PyHpPyHp 


126) 


5' 


-w 


A 


T 


A 


A 


W- 


3' 


126) PyHpPy Py -y - HpHpPyHp 


127) 


5' 


-w 


A 


T 


A 


G 


W- 


3* 


127) PyHpPylm-y-PyHpPyHp 


128) 


5" 


-w 


A 


T 


A 


C 


W- 


3» 


12 8 ) PyHpPyPy-y- ImHpPyHp 


129) 


5' 


-w 


A 


T 


6 


T 


W- 


3' 


129) PyHp ImHp - y - Py Py PyHp 


130) 


5' 


-w 


A 


T 


G 


A 


W- 


3' 


130) PyHp I mPy - y - Hp Py Py Hp 


131) 


5' 


-w 


A 


T 


G 


G 


W- 


3» 


131) PyHp Imlm-y- Py Py Py Hp 


132) 


5' 


-w 


A 


T 


G 


C 


w- 


3' 


132) PyHpImPy-y-ImPy PyHp 


133) 


5- 


-w 


A 


T 


C 


T 


w- 


3' 


133) PyHp PyHp -y- PylmPyHp 


134) 


5 


-w 


A 


T 


C 


A 


w- 


•3' 


134) PyHpPyPy-y-HpImPyHp 


135) 


5 


-w 


A 


X 


C 


G 


w- 


-3" 


135) PyHpPylm-y- PylmPyHp 


136) 


5 


-w 


A 


T 


C 


C 


w- 


-3' 


136) PyHpPyPy-y- ImlmPyHp 
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TABLE 18: 8-ring Hairpin Polyamides for recognition of 6-bp 5*-WAANNW-3* 



DNA sequence aromatic amino acid sequence 



137) 


5' 


-W 


A 


A 


T 


T 


W-3» 


137)PyPyHpHp-y- 


- PyPyHpHp 


138) 


5' 


-W 


A 


A 


T 


A 


W-3» 


138)PyPyHpPy-y- 


-HpPyHpHp 


139) 


5' 


~W 


A 


A 


T 


G 


W-3' 


139) PyPyHpIm-y- 


-PyPyHpHp 


140) 


5 1 


-W 


A 


A 


T 


C 


W-3 1 


14 0 ) PyPyHpPy -y- 


* ImPyHpHp 


141) 


5' 


-W 


A 


A 


A 


T 


W-3» 


14 1 ) PyPyPyHp -y- 


-PyHpHpHp 


142) 


5' 


-W 


A 


A 


A 


A 


W-3' 


142 ) PyPyPyPy-y- 


-HpHpHpHp 


143) 


5' 


-W 


A 


A 


A 


G 


W-3 1 


143 ) PyPyPylm-y- 


-PyHpHpHp 


144) 


5' 


-W 


A 


A 


A 


C 


W-3» 


144) PyPyPyPy-y- 


-ImHpHpHp 


145) 


5 1 


-w 


A 


A 


G 


T 


W-3 ' 


-L-"± _> / ryryi IILM J_> — y - 




146) 


5' 


-w 


A 


A 


G 


A 


W-3* 


146) PyPylmPy-y- 


-HpPyHpHp 


147) 


5' 


-w 


A 


A 


G 


G 


W-3 1 


147) PyPylmlm-y- 


-PyPyHpHp 


148) 


5' 


-w 


A 


A 


G 


C 


W-3» 


14 8) PyPyImPy-y- 


- ImPyHpHp 


149) 


5' 


-w 


A 


A 


C 


T 


W-3' 


149) PyPyPyHp~y- 


- PylmHpHp 


150) 


5* 


-w 


A 


A 


C 


A 


W-3» 


150) PyPyPyPy-y- 


■HpImHpHp 


151) 


5 1 


-w 


A 


A 


C 


G 


W-3» 


151) PyPyPylm-y- 


-PylmHpHp 


152) 


5' 


-w 


A 


A 


c 


C 


W-3' 


152 ) PyPyPyPy-y- 


- ImlmHpHp 
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TABLE 19: 8-ring Hairpin Polyamides for recognition of 6-bp S'-WAGNNW^* 



DNA sequence aromatic amino acid sequence 



5 


153) 


5' 


-W A G T 


T W-3' 


153) PylmHpHp-y-PyPyPyHp 




154) 


5 1 


-W A G T 


A W-3' 


154) Py ImHp Py-y-HpPy PyHp 


10 


155) 


5' 


-W A G T 


G W-3' 


155) Py ImHp Im - y ~ Py Py PyHp 


156) 


5' 


-W A G T 


C W-3» 


156) PylmHpPy-y- ImPyPyHp 




157) 


5* 


-W A G A 


T W-3» 


157) PylmPyHp -y- PyHp PyHp 


15 


158) 


5* 


-W A G A 


A W-3' 


15B) PylmPyPy-y-HpHpPyHp 




159) 


5' 


-W A G A 


G W-3' 


159) PylmPylm-y- PyHp PyHp 


20 


160) 


5* 


-W A G A 


C W-3' 


160) PylniPyPy-y- ImHp PyHp 




161) 


5' 


-W A G G 


T W-3» 


161) PylmlmHp-y-PyPyPyHp 




162) 


5' 


~W A G G 


A W-3' 


1 6 2 ) Py ImlmPy - y - HpPy PyHp 


25 


163) 


5' 


-W A G C 


T W-3' 


163) PylmPyHp -y- PylmPyHp 




164) 


5'- 


W A G C i 


ft. W-3 ' 


164) Py ImPyPy - y- HpImPyHp 


30 


165) 


5 f - 


W A G G 


G W-3 » 


165) Pylmlmlm^y- PyPyPyHp 




166) 


5'- 


W A G G 


C W-3 ' 


166) PylmlmPy-y- ImPyPyHp 




167) 


5'- 


W A G C 


G W-3' 


167) PylmPylm-y- PylmPyHp 


35 


168) 


5'- 


-W A G C 


C W-3' 


16 B) PylmPyPy-y-ImlmPyHp 
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TABLE 20: 8-ring Hairpin Polyamides for recognition of 6-bp S^WACNNW-3 7 
DNA sequence aromatic amino acid sequence 



5 


169) 


5 ' 


-w 


A 


C 


rrt 

T 


T 


W- J * 


169) PyPyHpHp-y-PyPylmHp 




170) 


5' 


-w 


A 


C 


T 


A 


W-3 » 


170) PyPyHpPy-y-HpPy ImHp 




171) 


5» 


-w 


A 


c 


T 


G 


W-3* 


171) Py Py Hp Im - y - Py Py ImHp 


10 






















172) 


5» 


-w 


A 


c 


T 


C 


W-3 1 


172) PyPyHpPy-y- ImPylmHp 




173) 


5' 


-w 


A 


c 


A 


T 


W-3' 


173) PyPyPyHp-y-PyHplmHp 


15 


174) 


5 1 


-w 


A 


c 


A 


A 


W-3 ' 


174) PyPyPyPy-y-HpHpImHp 




175) 


5« 


-w 


A 


c 


A 


G 


W-3« 


175) Py Py Py Im - y - PyHp ImHp 




176) 


5' 


-w 


A 


c 


A 


C 


W-3 » 


176) PyPyPyPy-y- ImHp ImHp 


20 






















177) 


5 ' 


-W 


A 


c 


6 


T 


W-3 1 


177) PyPy ImHp -y- PyPy ImHp 




178) 


5' 


-w 


A 


c 


G 


A 


W-3' 


178) PyPylmPy-y-HpPylmHp 


25 


179) 


5' 


-w 


A 


c 


C 


T 


W-3' 


179) PyPyPyHp-y- PylmlmHp 




180) 


5 


-w 


A 


c 


C 


A 


W-3' 


180) PyPyPyPy-y-HpImlmHp 




181) 


5 


1 -w 


A 


c 


6 


G 


W-3' 


181) PyPylmlm-y- PyPylmHp 


30 






















182) 


5 


1 -w 


A 


c 


G 


C 


W-3' 


182) PyPylmPy-y- ImPylmHp 




183) 


5 


1 -w 


A 


c 


C 


G 


W-3» 


183) PyPyPylm-y- PylmlmHp 


35 


184) 


5 


' -w 


A 


c 


C 


C 


W-3 ' 


1B4) PyPyPyPy-y- ImlmlmHp 
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TABLE 21: 8-ring Hairpin Polyamides for recognition of 6-bp 5*-WCTNNW-3* 
DNA sequence aromatic amino acid sequence 





1 ft*;* 

**» O -3 / 


C t 


-W 


r* 

I— 




rn 
X 


X 




185 ; PyHpHpHp-y- PyPyPylm 




186) 


5 1 


-w 


c 


T 


T 


A 


W-3» 


186) PyHpHpPy-y-HpPyPylm 


10 


187) 


5» 


-w 


c 


T 


T 


G 


W-3" 


187 > PyHpHpIm-y- PyPyPylm 




188) 


5* 


-w 


c 


T 


T 


C 


W-3" 


188) PyHpHpPy-y-ImPyPylm 




189) 


5' 


-w 


c 


T 


A 


T 


W-3» 


189) PyHpPyHp~y-PyHpPyIm 


15 


190) 


5 f 


-W 




T 


A 


A 


W-3 • 


±zf\J) fy rip try try J rip rip Py x tu 




191) 


5' 


-w 


c 


T 


A 


G 


W~3 ■ 


191) PyHpPylm-y-PyHpPylm 


20 


192) 


5- 


-w 


c 


T 


A 


C 


W-3 " 


192 ) PyHpPyPy-y- ImHpPylm 




193) 


5» 


-w 


c 


T 


G 


T 


W-3 ' 


193 ) PyHpImHp-y~PyPyPyIrn 




194) 


5» 


-w 


c 


T 


6 


A 


W-3 » 


194 > PyHpImPy-y-HpPyPylm 


25 


195) 


5« 


-w 


c 


T 


6 


G 


W-3' 


195) PyHpImlm-y-PyPyPylm 




196) 


5» 


-w 


c 


T 


6 


C 


W-3 ' 


196 ) PyHpImPy-y- imPyPylra 


30 


197) 


5« 


-w 


c 


T 


C 


T 


W-3 ' 


197) PyHpPyHp-y-PylmPylm 




198) 


5* 


~w 


c 


T 


C 


A 


W-3 • 


198) PyHpPyPy-y-HpImPylm 




199) 


5« 


-w 


c 


T 


C 


G 


W-3' 


199) PyHpPylm-y-PylmPylm 


35 


200) 


5' 


-w 


c 


T 


C 


C 


W-3 ■ 


200) PyHpPyPy-y-lmlmPylm 
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TABLE 22: 8-ring Hairpin Polyamides for recognition of 6-bp 5 > -WCANNW-3 > 



DNA sequence aromatic amino acid sequence 



5 


201) 


5 


'-W 


c 


A 


T 


T 


W-3* 


201) PyPyHpHp -y- PyPyHp Im 




202) 


5 


»-W 


c 


A 


X 


A 


W-3' 


202 ) PyPyHpPy-y-HpPyHpIm 


10 


203) 


5 


-W 


c 


A 


T 


G 


W-3' 


203 ) Py Py Hp I m - y - Py PyHp Im 


204) 


5 


-w 


c 


A 


X 


C 


W-3" 


204 ) PyPyHpPy-y- ImPyHpIm 




205) 


5 


-w 


c 


A 


A 


T 


W-3' 


205) PyPyPyHp -y-PyHpHpIm 


15 


206) 


5 


-w 


c 


A 


A 


A 


W-3' 


206) Py PyPy Py -y - HpHpHp Im 




207) 


5' 


-w 


c 


A 


A 


G 


W-3' 


207)PyPyPyIm-y-PyHpHpIm 


20 


208) 


5' 


-w 


c 


A 


A 


C 


W-3* 


208 ) PyPypyPy-y- ImHpHpIm 




209) 


5' 


-w 


c 


A 


G 


T 


W-3' 


209) PyPylmHp-y-PyPyHpIm 




210) 


5' 


-w 


c 


A 


G 


A 


W-3» 


210) PyPylmPy-y-HpPyHpIm 


25 


211) 


5' 


-w 


c 


A 


G 


G 


W-3» 


211) PyPylmlm-y - PyPyHpIm 




212) 


5' 


-w 


c 


A 


G 


C 


W-3' 


2 12 ) PyPylmPy-y- ImPyHpIm 


30 


213) 


5' 


-w 


c 


A 


C 


T 


W-3* 


213) PyPyPyHp-y-PylmHpIm 




214) 


5' 


-w 


c 


A 


c 


A 


W-3' 


214 ) PyPyPyPy-y-HplmHpIm 




215) 


5" 


-w 


c 


A 


c 


G 


W-3' 


215) PyPyPyim-y-PylmHpIm 


35 


216) 


5' 


-w 


c 


A 


c 


C 


W-3' 


216 ) PyPyPyPy-y- ImlmHpIm 
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TABLE 23: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WCGNNW-3 7 







DNA sequence 


aromatic amino acid sequence 


5 


2X7) 


5 * -W C G T T W-3 ■ 


217) PylmHpHp-y-PyPyPyim 




218) 


5 1 -W C G T A W-3" 


218) PylmHpPy-y-HpPyPylm 


10 


2X9) 


5 • -W C G T G W-3 • 


219) PylmHpIm-y-PyPyPylm 




220) 


5 ' -W C G T C W-3« 


J 

220) PylmHpPy-y-ImPyPylm 




221) 


5 » -W C G A T W-3 • 


221) PylmPyHp-y-PyHpPylm 


15 


222) 


5 ' -W C G A A W-3 1 


222) PylmPyPy-y-HpHpPylm 




223) 


5 1 -W C G A G W-3' 


223 ) PylmPylm-y-PyHpPylm 


20 


224) 


5 ' -W C G A C W-3' 


224 ) PyimPyPy-y- ImHpPylm 




225) 


5 ' -W C G G T W-3 1 


225) PylmlmHp-y-Pypypylm 




226) 


5 * -W C G G A W-3 ' 


226) PylmlmPy-y-HpPyPylm 


25 


227) 


5 • -W C G C T W-3» 


227) PylmPyHp-y-PyirnPylm 




228) 


5 ' -W C G C A W-3» 


228) PylmPyPy-y-HpImPylm 


30 


G9) 


5 • -W C G G G W-3 1 


G9) Pylmlmlm-y-PyPypylm 




G10) 


5 ' -W C G G C W-3' 


G10) PylinlmPy-y- ImPyPylm 




Gil) 


5 * -W C G C G W-3' 


Gil) PylmPylm-y- PylmPylm 


35 


G12) 


5 1 -W C G C C W-3 1 


G12 ) PylmPyPy-y-ImlmPylm 



49 



WO98/37066 



PCT/US98/01006 



TABLE 24: 8-ring Hairpin Polyamides for recognition of 6-bp S^WCCNNW-3* 
DNA sequence aromatic amino acid sequence 



5 


229) 


5» 


-W 


c 


c 


T 


T 


W-3' 


229) PyPyHpHp-y- PyPylmlm 




230) 


5' 


-w 


c- 


c 


T 


A 


W-3 1 


230) PyPyHpPy-y-HpPylmlm 




231) 


5« 


-w 


c 


c 


T 


G 


W-3» 


231) PyPyHpIm-y-PyPylmlm 




232) 


5» 


-w 


c 


c 


T 


C 


W-3' 


232 ) PyPyHpPy-y- ImPylmlm 




233) 


5' 


-w 


c 


c 


A 


T 


W-3* 


233 ) PyPyPyHp-y-PyHpImlm 


15 


234) 


5» 


-w 


c 


c 


A 


A 


W-3' 


234) PyPyPyPy-y-HpHpImlm 




235) 


5 1 


-w 


c 


c 


A 


G 


W-3' 


235) PyPyPylm-y-PyHplrnlm 




236) 


5 1 


-w 


c 


c 


A 


C 


W-3» 


236 ) PyPyPyPy-y- ImHpImlm 




237) 


5' 


-w 


c 


c 


G 


T 


W-3 1 


237) PyPylmHp-y-PyPylmlm 




238) 


5' 


-w 


c 


c 


G 


A 


W-3' 


238) PyPylmPy-y-HpPylmlm 


25 


239) 


5" 


-w 


c 


c 


C 


T 


W-3 1 


239) PyPyPyHp - y - Py Imlmlm 




240) 


5 


-w 


c 


c 


C 


A 


W-3' 


2 4 0 ) PyPyPyPy-y -Hplmlmlm 


30 


G13) 


S 


-w 


c 


c 


G 


G 


W-3' 


G13) PyPylmlm-y-PyPylmlm 




G14) 


5 


-w 


c 


c 


G 


C 


W-3' 


G14 ) PyPylmPy-y- ImPylmlm 




G15) 


5 


' -w 


c 


c 


C 


G 


W-3 1 


G15) PyPyPylm-y-Pylralmlm 


35 


G16) 


5 


» -w 


c 


c 


C 


C 


W-3' 


G16) PyPyPyPy-y-lmlmlmlm 



EXAMPLE 9: 

Aliphatic/Aromatic amino acid pairing for recognition of the DNA minor groove. 

40 

Selective placement of an aliphatic P-alanine (P) residue paired side-by-side with either 
a pyrrole (Py) or imidazole (Im) aromatic amino acid is found to compensate for sequence 
composition effects for recognition of the minor groove of DNA by hairpin pyrrole-imidazole 
polyamides. A series of polyamides were prepared which contain pyrrole and imidazole 
45 aromatic amino acids, as well as y-aminobutyric acid (y) "turn" and p-alanine "spring" aliphatic 
amino acid residues. The binding affinities and specificities of these polyamides are regulated 
by the placement of paired p/p Py/p and Im/p residues. Quantitative footprint titrations 
demonstrate that replacing two Py/Py pairings in a 12-ring hairpin (6-y-6) with two Py/p 
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pairings affords 10-fold enhanced affinity and similar sequence specificity for an 8-bp target 
sequence. 

Table 25 Equilibrium association constants (M~ * ) for polyamides. a c 



Polyamide 5* 


-TGTTAACA-3' 


5'-TGTGAACA-3' 


Specificity^ 


moocxxx 

^-<K>0-0-0<>^ 


2.5 x tO 9 


3.9 x I0» 


6 


tOOCKKX 


1.3 xlO 9 


2.0 x 10* 


7 


•OOCXXX 


1.7 x id" 


2.7 x 10 9 


6 


•oooocx 


K2x 10" 


22 x 10 9 


55 




6.6 x 10* 


2.5 x 10 R 


26 


•-OOOCKX 

^CKK>O<>0^ 


4.5 x 10" 


7.7 x 10 9 


6 


•OCKKKX 


2.7 x 10" 


5.7 x I0 9 


5 


♦<HHKKX 


* 1 x 10* 


s t x 10* 


I 



° Values reported are the mean values obtained from three DNase I 
footprint titration experiments. *Thc assays were carried out at 22 °C at 
pH 7.0 in the presence of 10 mM Tris-Ha, 10 mM KQ, 10 mM MgCI 2t 
and 5 mM CaCIv c Match site association constants and specificities 
higher than the parent hairpin are shown in boldly pe. ^Specificity is 
calculated as K. a (match) / K a (mismatch). 

5 

The 6-y-6 hairpin ImPylmPyPyPy-y-ImPyPyPyPyPy-P-Dp, which contains six 
consecutive amino acid pairings, is unable to discriminate a single-base-pair mismatch site 5'- 
TGTTAACA-3' from a 5 * -TGTGAAC A-3 * match site. The hairpin polyamide Im-P- 
ImPyPyPy-y-ImPyPyPy-p-Py-p-Dp binds to the 8-bp match sequence 5 '-TGTGAAC A-3* with 
10 an equilibrium association constant of Ka ~ 2A x 10*^ M~* and > 48-fold specificity versus the 
5 * -TGTTAACA-3 ' single-base-pair mismatch site. 
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Table 26 Equilibrium association constants (M~ * ) for polyamidcs. 0 0 



Polyamide 


5-TGTTAACA-3' 


5'TQrtGA\CA~y 


Specificity** 


•OOOOCK 


2.5 xlO 9 


3.9 x 10* 


6 


•-o-oooo. 

^M><>-CK><>^ 


6.6 x 10* 


2L5xl0 8 


26 




5xl0 9 


5xl0 9 


1 




*5xl0 8 


2.4 xlO" 


*48 



° Values reported for 1, 5, and 10 are the mean values obtained from 
three DNase 1 footprint titration experiments. *The assays were carried 
out at 22 °C at pH 7.0 in the presence of 10 mM Tris-HQ, 10 mM KQ, 
10 mM Mgd* andSmMCaC^. c Match site association constants 
and specificities higher than parent hairpins are shown in 
boldtype. ''Specificity is calculated as lC/match) / K a (mismatch). 

Modeling indicates that the (J-alanine residue relaxes ligand curvature, providing for 
optimal hydrogen bond formation between the floor of the minor groove and both Im-residues 
within the Im-p-Im polyamide subunit. This observation provided the basis for design of a 
hairpin polyamide, Im-P-ImPy-y-Im-P-ImPy-p-Dp, which incorporates Im/p pairings to 
recognize a "problematic" 5 , -GCGC-3* sequence at subnanomolar concentrations. 

Table 27 Equilibrium association constam ts (MT 1 ) for polyamides. 0 *^ 

Polyamide 5*-TGCGCA-3' 5>*-TGGCCA-3* 5'"TGGGGA-3' 

3.7 xlO 7 <10 7 <10 7 

^5^5^ 3.7 xlO 9 1.4 xlO 8 1.1 xlO 8 

a Values reported are the mean values obUatned from a minimum of three 
DNase I footprint titration experiments. tb T\\c assays were carried out at 
22 °C at pH 7.0 in the presence of 10 mMlTris-HCI, 10 mM KC1, 10 mM 
MgCt^ and 5 mM CaCl 2 . 

These results identify Im/p and p/Im pairings that respectively discriminate G*C and 
OG from A*T/T*A as well as Py/p and p/Py pairings that discriminate A # T/T*A from 
G*C/OG. These aliphatic/aromatic amino acid pairings will facilitate the design of hairpin 
polyamides which recognize both a larger binding site size as well as a more diverse sequence 
repertoire. 

EXAMPLE 10: 
POLYAMIDE BIOTIN CONJUGATES 

Bifunctional conjugates prepared between sequence specific DNA binding polyamides 
and biotin are useful for a variety of applications. First, such compounds can be readily attached 
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to a variety of matrices through the strong interaction of biotin with the protein streptavidin. 
Readily available strepdavidin-derivatized matrices include magnetic beads for separations as 
well as resins for chromatography. 

A number of such polyamide-biotin conjugates have been synthesized by solid phase 
synthetic methods outlined in detail above. Following resin cleavage with a variety of diamines, 
the polyamides were reacted with various biotin carboxylic acid derivatives to yield 
bifunctional conjugates. The bifunctional conjugates were purified by HPLC and characterized 
by MALDI-TOF mass spectroscopy and ] H NMR. 

The scheme for the synthesis of an exemplary biotin-polyamide conjugate is shown 

below. 




The foregoing is intended to be illustrative of the present invention, but not limiting. 
Numerous variations and modifications of the present invention may be effected without 
departing from the true spirit and scope of the invention. 
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1. In a polyamide having at least three consecutive carboxamide pairs for 
binding to at least three DNA base pairs in the minor groove of a duplex 
DNA sequence having at least one A«T or T«A DNA base pair, the 
improvement comprising selecting a Hp/Py carboxamide pair to 
correspond to a T> A base pair in the minor groove of the duplex DNA 
sequence or selecting a Py/Hp carboxamide pair to bind to an A*T DNA 
base pair in the minor groove of the duplex DNA sequence. 

2. The polyamide of claim 1 wherein at least four consecutive carboxamide 
pairs bind to at least four DNA base pairs. 

3. The polyamide of claim 1 wherein at least five consecutive carboxamide 
pairs bind to at least five DNA base pairs. 

4. The polyamide of claim 1 wherein at least six consecutive carboxamide 
pairs bind to at least six DNA base pairs. 

5. The polyamide of claim 1 wherein the A»T or T»A base pair has a G»C 
or OG base pair on either side. 

6. The polyamide of claim 1 wherein the duplex DNA sequence is a 
regulatory sequence. 

7. The polyamide of claim 1 wherein the duplex DNA sequence is a 
promoter sequence. 

8. The polyamide of claim 1 wherein the duplex DNA sequence is a coding 
sequence. 

9. The polyamide of claim 1 wherein the duplex DNA sequence is a non- 
coding sequence. 

10. The polyamide of claim 1 wherein the binding of the carboxamide pairs 
to the DNA base pairs modulates the expression of a gene. 
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11. A composition comprising an effective amount of the polyamide of claim 
1 and apharmologically suitable excipient. 

12. A diagnostic kit comprising the polyamide of claim 1 . 

13. A polyamide according to claim 1 having the formula: 

X1X2X3X4-Y-X5X6X7X8 
wherein y is -NH-CH2-CH2-CH2-CONH- hairpin linkage derived from 
y-aminobutyric acid or a chiral hairpin linkage derived from R-2,4- 
diaminobutyric acid; X4/X5, X3/X6* X2/X7, and X\/Xg represent 
carboxamide binding pairs which bind the DNA base pairs wherein at 
least one binding pair is Hp/Py or Py/Hp and the other binding pairs are 
selected from Py/frn Im/Py to correspond to the DNA base pair in the 
minor groove to be bound. 

14. The polyamide of claim 13 wherein there is at least one p-aianine in a 
non- Hp containing binding pair. 

15. The polyamide of claim 13 wherein dimethylaminopropylamide is 
covalently bound to Xi or X& 

16. A polyamide selected from those listed in Tables 9-24 as 
compounds 1 through 240. 

17. A polyamide selected from shown in Fig. 4. 
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1/17 
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o i 
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o i 
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9v * 
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O 
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2/17 



C A-3* 




fyOOO 

3 f ~A C C 



5'-T O G|I)C A-3' 
O T-5' 

Py/Py with A»T 



+xxx> 
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